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Preface 


This book is a reprinting, with minor revisions and one correction, 
of notes originally prepared by John P. Brown from the lectures given 
in 1943 by the late Professor Witold Hurewicz at Brown University. 
They were first published in mimeographed form by Brown University 
in 1943, and were reissued by the Mathematics Department of The 
Massachusetts Institute of Technology in 1956. They are now re- 
printed with the permission of Brown University and of Dr. Stefan 
Hurewicz. 

An appreciation of Witold Hurewicz by Professor Solomon Lefschetz, 
which first appeared in the Bulletin of the American Mathematical 
Society, is included-in this book, together with a bibliography of his 
published works. 

Since this book treats mainly of existence theorems, linear systems, 
and geometric aspects of nonlinear systems in the plane, a selected list 
of books on differential equations has been placed at the end of the 
volume for those interested in further reading. 

The work of the several mathematicians who prepared this new 
edition was supported by the Office of Naval Research. 


NORMAN LEVINSON 
Massachusetts Institute of Technology 


Witold Hurewicz 
In Memoriam’ 


SOLOMON LEFSCHETZ 


Last September sixth was a black day for mathematics. For on that 
day there disappeared, as a consequence of an accidental fall from 
a pyramid in Uxmal, Yucatan, Witold Hurewicz, one of the most 
capable and lovable mathematicians to be found anywhere. He had 
just attended the International Symposium on Algebraic Topology 
which took place during August at the National University of Mexico 
and had been the starting lecturer and one of the most active par- 
ticipants. He had come to Mexico several weeks before the meeting 
and had at once fallen in love with the country and its people. As a 
consequence he established from the very first a warm relationship 
between himself and the Mexican mathematicians. His death caused 
among all of us there a profound feeling of loss, as if a close relative 
had gone, and for days one could speak of nothing else. 

Witold Hurewicz was born on June 29, 1904, in Lodz, Russian 
Poland, received his early education there, and his doctorate in Vienna 
in 1926. He was a Rockefeller Fellow in 1927-1928 in Amsterdam, 
privaat docent there till 1936 when he came to this country. The 
Institute for Advanced Study, the University of North Carolina, Radia- 
tion Laboratory and Massachusetts Institute of Technology (since 1945) 
followed in succession. 


* Reprinted by permission from the Bulletin of the American Mathematical 
Society, vol. 63, no. 2, pp. 77~82 (March, 1957). 
ix 


x WITOLD HUREWICZ IN MEMORIAM 


Mathematically Hurewicz will best be remembered for his important 
contributions to dimension, and above all as the founder of homotopy 
group theory. Suffice it to say that the investigation of these groups 
dominates present day topology. 

Still very young, Hurewicz attacked dimension theory, on which he 
wrote together with Henry Wallman the book Dimension theory [39]. 
We come to this book later. The Menger-Urysohn theory, still of recent 
creation, was then in full bloom, and Menger was preparing his book 
on the subject. One of the principal contributions of Hurewicz was the 
extension of the proofs of the main theorems to separable metric spaces 
[2 to 10] which required a different technique from the basically Euclid- 
ean one of Menger and Urysohn. Some other noteworthy results ob- 
tained by him on dimension are: 

(a) A separable metric n-space (=n dimensional space) may be 
topologically imbedded in a compact metric n-space [7]. 

(b) Every compact metric n-space Y is the map of a compact metric 
zero-space X in such a manner that no point of Y has more than n + 1 
antecedents, where n cannot be lowered, and conversely where this holds 
dim Y = n.. In particular one may choose for X a linear set containing 
no interval [6]. 

(c) Perhaps his best dimension result is his proof and extension of 
the imbedding theorem of compact spaces of dimension <n in Euclid- 
ean E2,,1, which reads: A compact metric n-space X may be mapped 
into En4m (m = 1, 2, +++), so that the points which are images of k 
points of X make up a set of dimension =n — (k — 1)m [26]. 

This proposition mav also be generalized as follows: Any mapping 
f: X — En4m may be arbitrarily approximated by one behaving as 
stated. Special case: X may be mapped topologically into Eon4+1. 
Earlier proofs of this last theorem existed. The wholly original proof 
of the main theorem by Hurewicz rests upon the utilization of the space 
EX, ,, of mappings of X — E,4m, as defined by Fréchet and the proof 
that the mappings of the desired type are dense in EX, _. 

A more special but interesting dimensional result is: 

(d) Hilbert space is not a countable union of finite dimensional 
spaces [10]. 

Recall R. L. Moore’s noteworthy proposition: a decomposition of 
the two-sphere S? in upper semi-continuous continua which do not 
disconnect S? is topologically an S?._Hurewicz showed [17] that for 
S3 no such result holds and one may thus obtain topologically any com- 
pact metric space. This shows that R. L. Moore’s results describe a 


very special property of S?. 


1 Square brackets refer to the bibliography at the end. 
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Another investigation of Hurewicz marked his entrance into alge- 
braic topology. The undersigned had introduced so called LC" spaces: 
compact metric spaces locally connected in terms of images of p-spheres 
for every pn. One may introduce HLC” spaces with images of 
p-spheres replaced by integral p-cycles and contractibility to a point by 
~O in the sense of Vietoris. Hurewicz proved this very unexpected 
property: N.a.s.c. for X as above to be LC" is HLC” plus local con- 
tractibility of closed paths [33]. An analogous condition will appear 
in connection with homotopy groups. 

We come now to the four celebrated 1935 Notes on the homotopy 
groups, of the Amsterdam Proceedings [29; 30; 34; 35]. The attack is 
by means of the function spaces X*¥. Let Y be a separable metric 
space which is connected and locally contractible in the sense of Borsuk. 
Let S? denote the p-sphere. Let xo be a fixed point of $"-!, n=1, 
and yo a fixed point of Y. Let N be the subset of Y S*' consisting of 
the mappings F such that Fx» = yy. The group of the paths of N is 
the same for all components of M. It is by definition the nth homo- 
topy group z,(Y) of Y. For n = 1 it is the group of the paths of Y, 
and hence generally noncommutative but for n > 1 the groups are 
always commutative. Hurewicz proved the following two noteworthy 
propositions: 


I. When the first n — (n 2 2) homotopy groups of the space Y (same 
as before) are zero then the nth 7,(Y) is isomorphic with H,(Y), the 
nth integral homology group of Y. 

II. N.a.s.c. for a finite connected polyhedron {1 to be contractible to 
a point is 7,(11) = | and H,(11) = O for every n > 1. 

For many years only a few homotopy groups were computed suc- 
cessfully. In the last five years however great progress has been made 
and homotopy groups have at last become computable mainly through 
the efforts of J.-P. Serre, Eilenberg and MacLane, Henri Cartan, and 
John Moore. 

Many other noteworthy results are found in the four Amsterdam 
Proceedings Notes but we cannot go into them here. We may men- 
tion however the fundamental concept of homotopy type introduced 
by Hurewicz in the last note: Two spaces X, Y are said to be of the 
‘ same homotopy type whenever there exist mappings f: X > Y and 
g: YX such that gf and fg are deformations in X and Y. This con- 
cept gives rise to an equivalence and hence to equivalence classes. 
This is the best known approximation to homeomorphism, and com- 
parison according to homotopy type is now standard in topology. Iden- 
tity of homotopy type implies the isomorphism of the homology and 
homotopy groups. 
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At a later date (1941) and in a very short abstract of this Bulletin 
[40] Hurewicz introduced the concept of exact sequence whose mush- 
room like expansion in recent topology is well known. The idea rests 
upon a collection of groups G, and homeomorphism ¢, such that 


On4i on 
: — Gasr2—— Gayi > Ga > eet 


and so that the kernel of $n is ¢n41Gn42. This was applied by Hurewicz 
to homology groups and he drew important consequences from the 
scheme. 

Still another noteworthy concept dealt with by Hurewicz is that of 
fibre space. In a Note [38] written in collaboration with Steenrod 
there was introduced the concept of the covering homotopy, its exist- 
ence was established in fibre spaces, the power of the method was 
made clear. He returned to it very recently [45] to build fibre spaces 
on a very different basis. In another recent Note [46] written in col- 
laboration with Fadell there was established the first fundamental ad- 
vance beyond the theorem of Leray (1948) about the structure of 
spectral sequences of fibre spaces. 

Hurewicz made a number of excursions into analysis, principally 
real variables. A contribution of a different nature was his extension 
of G. D. Birkhoff’s ergodic theorem to spaces without invariant measure 
[42]. 

During World War II Hurewicz gave evidence of surprising versa- 
tility in distinguished work which he did for the Radiation Laboratory. 
This led among other things to his writing a chapter in the Servo 
Mechanisms series issued by the Massachusetts Institute of Technology. 

The scientific activity of Hurewicz extended far beyond his written 
papers important as these may be. One way that it manifested itself 
is through his direct contact with all younger men about him. He was 
ready at all times to listen carefully to one’s tale and to make all manner 
of suggestions, and freely discussed his and anybody else’s latest ideas. 
One of his major sources of influence was exerted through his books. 
Dimension theory [39] already mentioned is certainly the definitive 
work on the subject. One does not readily understand how so much 
first rate information could find place in so few pages. We must also 
mention his excellent lectures on differential equations [41] which has 
appeared in mimeographed form and has attracted highly favorable 
attention. 

On the human side Witold Hurewicz was an equally exceptional 
personality. A man of the widest culture, a first rate and careful lin- 
guist, one could truly. apply to him nihil homini a me alienum puto. 
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Tales were. also told of his forgetfulness—which made him all the more 
charming. Altogether we shall not soon see his equal. 
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Difterential Equations 
of the First Order 


in One Unknown Function 


PART A. 
THE CAUCHY-EULER APPROXIMATION METHOD 


1. Definitions. Direction Fields 


By a domain D in the plane we understand a connected open set of 
points; by a closed domain or region D, such a set plus its boundary points. 
The most general differential equation of the first order in one unknown 
function is 

F(x, y, y’) = 0 (1) 


where F is a single-valued function in some domain of its arguments. A 
differentiable function y(x) is a solution if for some interval of x, (x, y(x), 
y'(x)) is in the domain of definition of F and if further F(z, y(x), y'(x)) = 0. 
We shall in general assume that (1) may be written in the normal form: 


y = f(x,y) (2) 


where f(x, y) is a continuous function of both its arguments simultaneously 
in some domain D of the x-y plane. It is known that this reduction may 
1 
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be carried out under certain general conditions. However, if the reduc- 
tion is impossible, e.g., if for some (2%, Yo, Yo ) for which F(2%p, Yo, Yo) = 0 
it is also the case that [OF(Zp, Yo, Yo )]/oy’ = 0, we must for the present 
omit (1) from consideration. 

A solution or integral of (2) over the interval x) < x < 2, is a single- 
valued function y(x) with a continuous first derivative y'(x) defined on 
[p, x,] such that for 7%<¢4< 2: 


(a) (x, y(x)) is in D, whence f(x, y(x)) is defined 
(b) y'(&) = f(a, y(@)). (3) 


Geometrically, we may take (2) as defining a continuous direction field 
over D; i.e., at each point P: (x,y) of D there is defined a line whose 
slope is f(x, y); and an integral of (2) is a curve in D, one-valued in z and 
with a continuously turning tangent, whose tangent at P coincides with 
the direction at P. Equation (2) does not, however, define the most 
general direction field possible; for, if R is a bounded region in D, 
S(«, y) is continuous in R and hence bounded 


lfzy|<M, (a,y)inR (4) 


where M is a positive constant. If « is the angle between the direction 
defined by (2) and the x-axis, (4) means that « is restricted to such values 
that 

|tana| << M 


The direction may approach the vertical as P: (x,y) approaches the 
boundary C of D, but it cannot be vertical for any point P of D. 
This somewhat arbitrary restriction may be removed by considering 
the system of differential equations 
dx 


dy x 
TAP F= Oey) ©) 


where P and Q are the direction cosines of the direction at (x, y) to the 
x- and y-axes respectively and hence are continuous and bounded; and 
a solution of (5) is of the form 


z=a), y=y¥), b<t<h 


which are the parametric equations of a curve Lin D. We shall be able 
to solve (5) by methods similar to those which we shall develop for (2); 
hence we shall for the present consider only the theory of the equation (2). 


2. Approximate Solutions of the Differential Equation 


Since the physicist deals only with approximate quantities, an approxi- 
mate solution of a differential equation is just as good for his purposes as 
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an exact solution, provided the approximation is sufficiently close. The 
present section will formulate the idea of an approximate solution and 
prove that the approximation may be made arbitrarily close. Themethods 
used. will be practical even though rather crude. More important, the 
demonstration of the existence of approximate solutions will later lead 
to the proof of the existence and uniqueness of exact solutions. 


Definition 1. Let f(x, y) be continuous, (x,y) in some domain D. Let 
Ly << 4%< 2, be an interval. Then a function y(x) defined on [x;, 2] is a 
solution of y’ = f(x, y) up to the error e if: 


(a) y(x) is admissible; i.e., (x, y(x)) is in D, 4 << U< Xp. 

(b) y(x) is continuous; x,< 4% < 2p. 

(c) y(x) has a piecewise continuous derivative on [x,, %_] which may fail 
to be defined only for a finite number of points, say €,, §.,° * *s §n- 

(d) |y’(a) —f(@, y(x))| Se 4S rsrwxA#§,i=1,- ++, nm. 


Theorem 1. Let (29, Yo) be a point of D, and let the points of a rectangle 
R: |x — | <a, |y — yo] <b lie in D. Let | f(x, y)| < M, (x,y) in R. 
Then if h = min (a, b/M), there can be constructed an approximate solution 
y(x) of 

y =f@y) (1) 


over the interval |x — 2o| < h, such that y(%») = Yo, where the error « may 
be an arbitrarily small positive number. Observe that h is independent of e. 


Proof. The rectangle 
S: |x —ai<h, ly—yo|< Mh (2) 


is contained in R by the definition of h. See Fig. 1. 
Let the ¢ of the theorem be given. Since f(z, y) is continuous in S, it is 


P 


ga, rer 
Soh 
| 
|~___,_____,] @ 


Fig. 1 
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- uniformly so; i.e., given « > 0 (which we take to be the e of the theorem) 
there exists 6 > 0 such that 


for (£,9); (x,y) in S; | — 2, |g —y| <6. 
Let x,° + *,Z,_, be any set of points such that: 


(a) %<%<ay<: ++ <a, <2, = Mth; 
(6) x; — x4 <min(6, 6/M),  i=1,-- +n. (4) 


We shall construct the approximate solution on the interval 7 <24< 
2% +h; a similar process will define it on the interval 4 —h<u< %. 

The approximate solution will be a polygon constructed in the following 
fashion: from (2%, Y)) we draw a segment to the right with slope of 
S(Xo, Yo); this will intersect the line x = 2, in a point (%,%). From 
(x1, ¥,) we draw a segment to the right with slope f(2,, y,) intersecting 
X= 2, at yp; etc. The point (x,, y,) must lie in the triangle OPQ; for 
tan« = M, and |f(%, y)| << M. Likewise (a, y) lies in OPQ; etc. 
Hence the process may be continued up to 2, = 2%) + h, since the only 
way in which the process could stop would be for f(x;,, y;,) to be undefined ; 
in which case we should have |y, — Yo| > Mh contrary to construction. 
Analytically we may define y(x) by the recursion formulas 


Y(x) = Yea + (@ — a flGi-1, Y(%,-»)) (5) 


where 
Yi-1 = YX s_y), Ui y~RtcxX,, fel-r"n 


Obviously by definition y(x) is admissible, continuous, and has a piece- 
wise continuous derivative 


y (x) = f(a, YU:-1)), Lyy~oUS Ly i=1,-ccn 


which fails to be defined only at the points 2, i=1,°::,n—1. 
Furthermore, if x,.< 2% < 2, 
ly'(«) — f(a, y(x))| = |fev Yin) — fle, y(x))| (6) 


But by (4), |x — x;_,| < min (6, 6/M), and by (5) 


7) 
= :. se . M— = 
ly - Y%al SMe —2.4|5Me=6 
Hence by (3) 
[fv Ya) — f@, ¥@)| < € 


and 
|y’(x) — f(a, y(x))| < « ee as, i=1,2,--:n—-1 (7 
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Hence ¥(x) satisfies all the conditions of Def. 1, and the construction 
required by the theorem has been performed. This method of constructing 
an approximate solution is known as the Cauchy-Euler method. 

It is unnecessary to improve the value of A, since, in general, as we shall 
show, y(x) is defined in a larger interval than |x — a| <A. 


3. The Fundamental Inequality 


With a certain additional restriction upon f(x, y) we shall prove an 
inequality which will be the basis of our fundamental results. 


Definition 2. A function f(x, y) defined on a (open or closed) domain D 
is said to satisfy Lipschitz conditions with respect to y for the constant 
k > Off for every x, Yy, Y2 such that (x, Y,), (®, Y2) are in D 


In connection with this definition we shall need two theorems of 
analysis: 


Lemma 1. If f(x, y) has a partial derivative for y, bounded for all (x, y) 
in D, and D is convex (i.e., the segment joining any two points of D lies 
entirely in D), then f(x, y) satisfies a Lipschitz condition for y where the 
constant k is given by 


e 


k=] 


u.b. Le» (x, y) in D (2) 


Proof. By Rolle’s theorem there exists a number & such that 


foes) — fess) = a — 9) LS a h<t<y 


1.€., 


= 2 


| f(a, Yo) — f(x, yy)| < L.u.b. Ye — Yl 


since (x, ) is in D; whence the theorem. 


Lemma 2. If D is not convex, let D be imbedded in a larger domain D’. 
Let 6 be the distance between the boundaries C and C’ of D and D' respec- 
tively (i.e., 6 = min (PP’), P in C, P’ in C’), and let 6 > 0. Then if f(x, y) is 
continuous in D' (hence bounded by M, say, in D) and of/dy exists and is 
bounded by N in D", then f(x, y) satisfies a Lipschitz condition in D with 


respect to y for a constant 
( $) 
= max | V, — 


) 
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Proof. Let P,: (x, y;) and P,: (x, y2) be in D. 
I(x, 1) =i (x, Yo) 2M 
eS , 
Y1 — Ye b 
(b) If |y, — ye| <6, the segment P,P, lies wholly in D’. 
Sle, 1) — f(, Yo) 
Y¥1 — Ye 
Theorem 2. Let (2%, Yo) be a point of a region R in which f(x, y) is 
continuous and satisfies the Lipschitz condition for k. Let y(x), G(x) be 


admissible functions for |x — %9| < h (where h is any constant, not neces- 
sarily that of Theorem 1), satisfying 


y’ = f(z, y), |a — Z| = h 
with errors €, and €, respectively. Set 


P(e) = 9(@)—y@), eF ate (3) 


(a) If |y, — ys| > 4, 


Hence as in Lemma 1, < N. Hence the lemma. 


Then 
€ 
[p(x)| < etl?-*+! | p(wa)| + r (eres ael 1) (4) 
This is the fundamental inequality. 


Proof. We give the proof only for x < 2< 2%) +h; a similar process 
will give the proof for % —h<2x< 2%. By Def. 1, except for a finite 
number of points 
dy 


d a 
# _ fey|<a |e — pe. 0] <0 Laer csaxath 


Hence 
dy dy 


<klg—y| +e 
by the Lipschitz condition; i.e., 


#l<alp| +6 %y<B< ay +h (5) 


except for a finite number of points at which [dp(x)}/dx fails to be defined. 


Case |. Suppose p(x) #0, %<%< 2% +h. Hence, being con- 
tinuous, it has the same sign, say without loss of generality p(x) > 0. 
Then a fortiori we can write (5) in the form 


dp 


of <kpla) + « 6) 
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This may be written as 
e™ [p'(x) — kp(@)] < ee™ 


or, integrating from x, to x, where % <i e<axath 


zx z 
i] e** [ p'(x) — kp(a)) dx< ef e*t dy (6) 
t% 1% 
The integrand on the left-hand side may have a finite number of simple 
discontinuities but it has a continuous indefinite integral. Hence we may 
write 
1 xz 
[e** p(x), < « E Go «| 
or 


Pl) < eM plug) + = [e#e- — 1] (7) 
which is the required inequality. 
Case Il. If for all x, p(x) = 0, the theorem is obvious. 


Case Ill. If p(%) 4 0 where & is some fixed number 4» < F< %+h, 
but p(x) = 0 for some value of x, x) < x < #; since p(x) is continuous, 
there exists a number x,, %<2%,<%< 2% +A such that p(x) = 0, 
but p(x) 40, 7, <2<#. Applying Case I to the interval (x, 7) we 
have 


p(B) < eX#-®) p(x) + eee — 1] 


pee 8) 
which is an even stronger inequality than (4). 

Hence the inequality (4) holds in all cases, since, if p(x) < 0, the same 
results follow by considering | p(x)|. 


4. Uniqueness and Existence Theorems 


Theorem 3. If f(x,y) is continuous and satisfies a Lipschitz condition 
for y in a domain D, and if (2p, Yo) is in D and y(x) and 7x) are two exact 
solutions of y' = f(x,y) in an interval |x — x|<h such that y(%») = 
G(X) = Yo, then y(x) = Hx), |e — Xo] <h; i.e., there is at most one 
integral curve passing through any point of D. 


Proof. Applying Theorem 2, we have p(%) = ¥ — ¥ =0, and 
e=e,+¢,=0; hencep=y—y=0. 
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We may State this in the form that under the above assumptions, two 
integral curves cannot meet or intersect at any point of D. 

Observe that without the additional requirement of a Lipschitz con- 
dition uniqueness need not follow. For consider the differential equation 


_=y¥ (1) 


fa y)= y* is continuous at (0,0). But there are two solutions passing 
through (0, 0), namely 


(a) y=0 
@) : = (jx)? «x >0 (2) 
y=0 x<0 


Of course, ys does not satisfy the Lipschitz condition at y = 0. For, if 
y, = dandy, = — 6 
1 
= 5 (3) 


62/3 


KYs) — Sys) 
¥1— Ye 
which is unbounded for 6 arbitrarily small. 


Theorem 4. If f(x,y) is continuous and satisfies the Lipschitz condition 
for y in a domain D, then for (Xo, Yo) in D there exists an exact solution of 
y’ = f(x, y) for |x — | < h, where h is defined as in Theorem 1, such that 

Y(Xp) = Yo 
Proof. Given a monotone positive sequence {e,} approaching 0 as a 
limit, by Theorem 1 there exists a sequence {y,(x)} of functions satisfying 
Yn (%) = fl, Yn(X)) UP tO En, —-Yn(%0) = Yo 
over |% — %| <A; ice. 
[Yn (@) — fl, Yl@))| Sep, |B — MH] Sh (4) 
except for the finite set of points z,, i = 1,+ + -, m,. 
Part |. The sequence {y,(x)} converges uniformly over |x — %|<h 


to a (continuous) function (x). 
For let n and p be positive integers; and apply Theorem 2 to y,(2), 

Yn+ p(X) 

En + fntp 


[Yn(®) — Ynto(X)| < [ener =) (5) 


2 
en 1] 


whence the assertion. 
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Part Il. The sequence ({ Sa y,{2) a| approaches io St, y(t) dt 


uniformly for |z — x| <h. 
Let R be the rectangle |x — a)| <h, |y — yo| < MA, by hypothesis in 
D. Since f(x, y).is continuous in the closed domain R, given e« > 0, there 
exists 6 > 0 such that 


fay) -—fayl<6 ym), (@y)inR, |y—y|<d (©) 
Likewise for this 6 > 0 there exists N such that 
ly(z) — y(~)| <6, n>N, |x-al<h (7) 
in virtue of Part I. Hence ifn > N 
|f(z, y,(x)) — f(x, y@)|<<« n>N, |x-al<h (8) 


Therefore the sequence {f(2, y,(x))} converges uniformly to f(x, y(x)). 
Hence, by a well-known theorem we may reverse the order of integration 
and pass to the limit; which completes the proof of Part II. 


Part Ill. y(x) is differentiable, y(v) = yo, and 
y' (x) = fw, (x), |e — Bo] <h 


Integrating each side of (4) from xp to 2, we have 


[ Ee — fit, ¥,(0)] dt | <e,|% — %|<,h (9) 


But since y,(t) is continuous 
i) = 9 — | fle, (0) dt | < eh (10) 
Approaching the limit by Part II, we have 
yee) =m — | fle, yo) dt = 0 (11) 


from which the assertion follows by an elementary theorem on functions 
defined by definite integrals. 


Theorem 5. Let y(x) be an exact solution of y’ = f(x, y(x)) under the 
assumptions of Theorem 4, and j(x) an approximate solution up to the error 
€, for |x — X9| <h, such that G(x) = Yo. Then there exists a constant N 
independent of € such that |G(x) — y(x)| < €N, |x — x| <h. 


Proof. By Theorem 2 
eh — | 


k 


g-yl< 7 (e** — 1); ie, N= (12) 
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Theorem 5 justifies our description of the function of Theorem 1 as an 
“‘approximate solution,” for its actual value differs from that of the exact 
solution by a multiple of «; whereas in our definition we only required it 
to satisfy the differential equation to within an error e. To recapitulate 
our results: If f(x, y) is continuous in D and satisfies a Lipschitz condition 
with constant k, and (Xp, yo) is in D, then there exist quantities h and N 
such that 


(a) There is a unique exact solution y(z) of y'(x) = f(x, y(x)) passing 
through (%p, yp) and defined for |2 — 2| < A. 


(b) There may be constructed by the Cauchy-Euler method approxi- 
mate solutions #(x) such that, if « > 0 


l7(a—fag7a)|<<¢« |et—-al<h e748, 4 °° +8, 
|G(x) — y(x)| < Ne, la —a|<h (13) 
G(Xo) = y(X) = Yo 


Appendix to §4. Extension of the Existence Theorem 


We have seen that without the Lipschitz condition on f(z, y) thé 
solution need not be unique. The condition is, however, superfluous for 
the proof of existence of a solution. This may be proved directly; we 
give a proof based upon Theorem 4 by the use of Weierstrass’ polynomial 
approximation theorem. 


Theorem 6. If f(x, y) is continuous in D, there is a solution of y' = f(x, y) 
such that y(%») = Yo where (xo, Yo) lies in D, defined for |x —%|<h= 
min (a, b[M). 

Proof. We use the following two theorems of analysis: 


Lemma 1 (Weierstrass). If f(x,y) is continuous in the region R, given 
€ > 0, there exists a polynomial P(x, y) such that 


| P(2, y) — fia, y)| <6 (x, y) in R 
and 
P(x, y)<fley), @y)inR 

Definition 3. A set S of functions f(x) is equicontinuous over an 
interval (a, b] if given « > 0 there exists 6 > 0 such that if x1, x, in (a, b}, 
|x, — 2| < 6, then | f(x) — flx,)| < «, for all fin S. 

Lemma 2. An infinite set S of functions uniformly bounded and equi- 
continuous over a closed interval [a,b] contains a sequence converging 
uniformly over [a, b}. 
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By virtue. of Lemma |, let P,(x, y) be a sequence of polynomials such 
that 
[P,(2, y) — f(a, y)| <= &, |ar ar Xo| sS h, ly — Yo| < Mh 


where h and M are defined as usual, and the sequence {e,} approaches 0. 
Then {P,(x, y)} is uniformly bounded; and it may be assumed that M 
is the common bound of f and P,. Since P,(x, y) satisfies Lipschitz 
conditions in y, by Theorem 4 there exists a sequence of functions {y,,(x)} 
defined over |x — x| < A, satisfying y,'(x) = P,{2, y,(x)}, and such that 
Y,(%) = Yo. By Theorem 4 
[yn(x) = Yn(%o)| < Mh, |x 7 Xo| <h 
i.e., {y,(x)} is uniformly bounded over |x — xo| < h. 
Furthermore, {y,(x)} is equicontinuous over |x — 29| < h for 
Yn(X2) — Yul) = Yn (EN%2 — 4%), 2, SK E< He 
for all z,, 2, in [a,b]. But 
lyn'(2)| = |PalEs Yn(E))| <M 
Hence 
\Yn(%2) — Yn(%y)| < |*2 — 14|M 


Given ¢ > 0, if 6 = «/M, then if |x. — 2,| <6; 2%, 2 in |a — 2| <A, 
then |y,(a2) — Yn(s)| <e all m. Hence by Lemma 2, {y,(x)} has a 
subsequence converging uniformly over |z — 2% | << h to a (continuous) 
function y(x). It will do no harm if we denote the subsequence by the 
same symbol {y,(x)}. 

As in Theorem 4, Part II, we can prove that 


tim {fle ual) dt» | fle (0) a 


uniformly for |x — a <h. By definition 


y,(2) — Yo = [Pt y,(t)) de 
Hence we write 
ate) — vo —[ fe. va) at | <[ Pate vs(O) — Ae 4a) a 
But ifn > N 
|P.A2¥)—fayl|<en  |e*-—al<h, |ly—yl|< Mh 
. Hence forn > N 
Iya) — vo —[ “ft ¥a() ltt ooh 
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Letting m — 00 the above is valid with y,(x) replaced by y(x) and so 


y() — % — [ “Gyonal eee eae eh 


since e, tends to 0. As in Theorem 4, this last formula proves that y(x) 
actually is the required solution. 


5. Solutions Containing Parameters 


We consider next changes in the solution of a differential equation 
caused by either a small change in the initial conditions or a small variation 
in the function f(z, y). Our general results will be that such small changes 
will bring about only small changes in the solution. This again will be 
of interest to the physicist, since it will assure him that small errors in the 
statement of a problem cannot change the answer too greatly. 

We have seen already that the solution y(x) of 


y’ = f(x, y), Y(%) = Yo 
contains the initial value y, as parameter. We determine first a minimum 


range of values of x and ¥, for which we can be assured that y(, yo) will 
exist and be unique. 


Theorem 7. If f(x, y) is continuous and satisfies a Lipschitz condition on 
y in |x — %| <a, ly — G| < 4, (xo, Jo) some fixed point, there exists a 
unique solution y(x, Yo) of 

y =f),  y%) = Y 
in the region 
|r — al <hr’, Yo — Hol < 4/2, ly — Yol < Mh’ (1) 

where h' = min [a, (b/2M)], M being the bound of f(x, y) in |x — x| <a, 
ly — H| <5. 

Proof. The proof follows immediately upon applying Theorem 4 to 
the point (9, ¥9), |Yo — Go| < 5/2. Observe that the h’ of this theorem is 


not less than half of the A for which the original existence theorem was 
proved. 


Theorem 8. If in some domain D, f(x, y) is continuous and satisfies a 
Lipschitz condition on y for some k, and if a solution y(x, Yo) of 


y =f(x,y), YX) =Y% 


exists for some rectangle R, |x — | <h, |Yo — Go| <1, then y(x, yo) is 
continuous in x and yy simultaneously in R. 
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Proof. Consider the solutions y(x, y)'Y), y(x, yo), yo, and yp? 
satisfying |y“ — %|</,i= 1,2. Applying Theorem 2 to these two 
functions over |2 — x9| < h, we obtain 


[y(x, yo”) — y(& Yo)| < [yo — woe, = [ew — al <h (2) 

This means that in R, y(x, yo) is continuous in y uniformly in x. Since 
y(%, Yo) iS continuous in x for any value of Y, it follows by a general 
theorem of real variables that in R, y(x, yp) is continuous in x and y% 
simultaneously. 

Geometrically (2) means that two integral curves close enough together 
for one value of x stay close together for |x — 2| <h. 

With a little stronger restriction upon f(z, y) we now prove a further 
property of y(x, y)), namely: 


Theorem 9. Under the assumptions of Theorem 8 and if in addition 
(df/dy) (a, y) exists in D and is continuous in x and y simultaneously, then 
[Oy(X, Yo)]/OYy exists for (x, Yo) in R and is continuous in x and Yp 
simultaneously. 


Proof. Let % be an arbitrary value of yy which is to remain fixed until 
the end of the argument, satisfying |f) — %| </. Let yy be a variable 
in the same interval. For convenience we write 


Y(%, Jo) = G2), — Y(@ Yo) = Y(*) (3) 
We shall prove that the function 
(x) — ¥@) = 
P(x, ye) = * a, WFI (4) 
Yo — Yo 


approaches a limit as Yy—>¥Y, which limit must be [dy(x, ¥)]/Oy» at 
Yo = Yo; and that this limit has the required properties. 
In the first place we write 


d 
7, Me) — Ha] = fle, We) — fe, I) 
or by the mean-value theorem 
r) 
= [y(x) — 9(2)] E Sa, Fa) + d{y(@), we) (5) 


where as |y(x) — 9(x)| +0, d{y(x), H(x)} +0. Since by Theorem 8 
lim y(x) = 9(x) uniformly for z, la — x| <h 
Yo Io 
it follows that 
lim d{y(x), W(x)} = 0 (6) 
YoYo 
uniformly in x. 
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By (4) and (5) we may write 


Op(%, Yo) 
— <3. oo = ple, yo) E St, a) + S{y@), ie)}| ©) 
Since p #0 for ¥ #% by the general uniqueness theorem, we may 
divide (7) by p and integrate explicitly with respect to x 


pie, yo) = exp| | LY,(2, (0) + S{v), 9} az| (8) 


where the constant of integration is determined by the fact that p(», yy) = 
1. Equation (8) is valid for all y). But lim p(x, yo) exists; for by (6) 
and (8) Yoo 

= 90) _ oe 

UCB) — tim p(x, uo) = exp | fle, 902) a} ) 

OYo YoDo 7) 
The right-hand side of (9) is continuous in x and % simultaneously; for 
(Of/dy) (x, y) was continuous by hypothesis, and 9(x) = y(x, J) is con- 
tinuous by Theorem 8, 7%) now being considered the variable point. This 
completes the proof. 

By similar (but considerably more complicated) reasoning we can prove 
under the same assumptions that if f(z, y) in addition has a continuous 
first derivative with respect to some parameter y in a certain interval, the 
solution y(x) is also continuous and differentiable in wu. We omit the 
proof since this will appear as a special case of the general theory of 
Chapter 2. 

We finally prove that the solution is only slightly changed if /(z, y) is 
only slightly changed; precisely: 

Theorem 10. If in some domain D, 

(a) F(x, y) and f(x, y) are continuous, 

(b) f(x, y) satisfies a Lipschitz condition on y for some k, 

() |Fay—faynl<e (yin D, (10) 

(a) for some point (2p, Yo) in D, y(x) and 7x) are admissible in |x — x9| < 
h and satisfy 


y (2) = f(x, Y), 7] (x) = F (x, 9), y(%p) = G(X) = % 
then 


ya) — 7@)| < ; (**—1), |x—a|<h (11) 


Proof. By (10), #(x) is an approximate solution of y’ = f(x,y) with 
error €; applying Theorem 5 we immediately obtain (11). - 
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For example, in the neighborhood of the origin we can replace y’ = 
sin (xy), which cannot be integrated explicitly, by y’ = zy, which can; and 
for sufficiently small values of x and y the error will be arbitrarily small. 


PART B. 
CONTINUATION OF SOLUTIONS. OTHER METHODS 


6. Continuation of Solutions 


In Part A we proved essentially the following: If in a domain D the 
function f(x, y) is continuous and satisfies a Lipschitz condition in y 
uniformly, for every point (7p, ¥) of D there is a rectangle Ry such that 
the integral curve y(x) of y’ = f(x,y) passing through (2%, yp) can be 
extended at least to the right and left sides of Ry. Since Ry lies in D, by 
applying Theorem 4 to the point at which the integral curve goes out of 
Ry, we can extend the region in which the curve is defined. We now 
prove that if D is bounded, the integral curve passing through any point 
of D may be extended up to the boundary of D. 

We remark first that since D is bounded, and the integral curve cannot 
pass out of D, there exist positive numbers / and m, such that the integral 
curve passing through (2p, ¥p) can be defined in the open interval 


X—l<u<4X4+m (1) 


but not outside it. Our theorem will demonstrate that as x +2) + m 
the integral curve approaches the boundary of D; a similar proof holds 
for approach on the left. 


Theorem 11. Let D be an arbitrary bounded domain, and let f(x, y) be 
continuous in D and satisfy a Lipschitz condition in y locally in D; i.e., 
for every point of D there must be a neighborhood N in which the condition 
is satisfied uniformly. [For example, it is sufficient that (0/oy) f(x, y) 
exist and be continuous in D.| Let the solution y(x) of y' = f(a, y) passing 
through (Xo, Yo) be defined on the right only for 


Lyar<%+m (1’) 
Then if p(x) is the distance of the point P, (x, y(x)) from the boundary C of D 
lim p(x) =0 (2) 

L—->FqgtM 


Proof. Let « > 0 and let S be the region in D consisting of all points of 
D having a distance greater than « from C. Suppose the solution 
(x, y(x)) has points in S as x-> 2) +m. Then there exists a monotone 
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increasing sequence {z,}, 7 = 1, 2,- - -, such that 2,» 2» + m and the 
points P;, (x;, y(a,)) are allin S. Since S is bounded, the points P, have 
at least one cluster point P, (&, 9), which is in S or on its boundary and 
hence is inside of D. Since 2; 2% + m, X = 2% +m so that 1; < Zz 
for all 7. 

Since P is in S, P is the center of a rectangle R which is in D and with 
edges x= X+a, y=9¥+5 for some positive a and b. Let M bea 
bound on |/| in R. Then by decreasing a if necessary we can assume 
that a < b/(4M). 

Let R, be the rectangle with edges a = 4 —a,x=Z,andy=9% + 3b. 
All points P; must be in R, for j large enough. In particular, then, for 
some n, P,,, (a, y(X,)) is in R,. Hence from Theorem 4 it follows that the 
solution of y’ = f(z, y) through P,, must exist to the right of x, and stay 
_ in R. Indeed for the case here, h of Theorem 4 is given by 


h = min ( + a — x,, $b/M) 


and since + a — x, S 2aand 4b/M > 2a, it follows thath = + a—~2,. 
Hence the solution y(x) exists for z, <x S%+a. Since = 2 +m, 
this contradicts the fact that the solution y(z) is in D only for 7% —1< 
x<2%y+m. Hence as x —> 2, + m, the solution can have no points in 
S. Since ¢ is arbitrary, this proves the theorem. 


y y =(2/3x)?? 


y=0 


Fig. 2 


This does not necessarily mean that the integral curves will actually 
approach some particular point of C; they might oscillate over a finite 
interval in the neighborhood of the boundary. Also, a number of 
integral curves may approach the same point of the boundary; e.g., if 
D is any bounded domain such that in D, x > 0, and f(a, y) = y*, then 
any integral curve lying between y = ($x)? and y = 0 will approach the 
origin on the left (Fig. 2). 
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If D is unbounded, Theorem 11 does not hold as stated; however, we 
may replace it by a corollary: 


Corollary. If D is unbounded, under the other conditions of Theorem 11, 
as X —> Xy + m either 


(a) y(x) becomes unbounded, or 
(b) y(x) actually approaches the boundary of D. 


For if y(x) does not become unbounded, we may replace D by a suitable 
bounded domain, and Theorem 11 is applicable. 

Under certain conditions we can go farther and state that, even though 
D is unbounded, the integral curves will approach a real boundary. For 
example: 


Theorem 12. Let f(x,y) be continuous in D; %<x“x<%, -O< 
y < ©, and satisfy a Lipschitz condition on y uniformly in D. Then the 
integral curve y(x) passing through any point (Xp, Yo) of D is defined over the 
whole open interval x, << 4% < ,. 


Proof. Suppose there is a point %, x) << @ < a, beyond which y(z) 
cannot be continued. Since the Lipschitz condition is satisfied uniformly 
for some k, we may apply Theorem 2 to the approximate solution 
U(x) = Yo Over the interval 4< 24< #—e, for any e>0. The error 
of 7(x) is 
dy : 
Ig OD 


max = max|f(z,y)|,  m<2<F 6) 


This maximum is finite, since f(x, yo) is continuous over the closed interval 
LSxeXlE<-24,; letit be M. Then by Theorem 2 


lA 


M a 
ly) — wel < see — 1, my<ece—e (4) 
Hence y(x) is bounded in the neighborhood of x = 2%. Therefore by 
the Corollary to Theorem 11, (x, y(x)) must approach the boundary of 
Dasx-»>Z. This is manifestly impossible, and therefore y(x) must be 
defined forz)< a< 2. Similarly, y(x) must be defined for x, << x < Zp. 
Hence the theorem. 
For example, a solution of the equation 

d 

ae P(x) cos y + Q(x) siny 

dx 
{where P(x) and Q(x) are polynomials] passing through any given point 
must be defined for all values of x. 
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Now that we.have shown the existence and uniqueness of solutions in 
very general domains, we prove that the Cauchy-Euler method may be 
used to approximate an exact solution arbitrarily closely between any two 
values of x for which the solution is defined. 


Theorem 13. Under the hypotheses of Theorem 11, let y(x) be an exact 
solution of y' = f(x,y) in some domain D, passing through (%p, Yo) and 
defined for x = x,. Then the value of y(x,) may be determined by the 
Cauchy-Euler method to any degree of accuracy by a sufficiently fine sub- 
division of the interval xy< 4% < 2. 


Proof. The boundary C of D and the curve y(x), %< 2x < 2%, have 
a nonzero distance d, being both closed, disjoint, bounded sets. Let 7 be 
any number,0<7<d. The strip S,a<x< 4%, ly — y(a)| <7, lies 
entirely in D. Hence there is some value k of the Lipschitz constant for 
J, y), valid throughout S. We prove that if by the Cauchy-Euler 
method we begin to construct an approximate solution 7(z) at (%%p, Yo), 
satisfying the differential equation with an error e€ 


kn 
ve ela—%el . | ©) 


then (x) will be defined over the whole interval 7<2<2%,, and 
Fa) — y(x)| < 7. 

We know by Theorem 2 that 9(x) will certainly be defined in some 
interval to the right of 2, say << «<#. Then by Theorem 5 we can 
compute the actual error; it is 

ee _ eflf—ml _ 1 
V2) — ¥@)| <n al 1 (6) 


Hence if # < x,, |(@) — y(@)| <n; and therefore (x) still is in the 
strip S. As long as 7(x) stays in S, we can continue the approximating 
process, since in S, f(x, y) satisfies the conditions of Theorem 2. Hence 
jx) is defined up to x = 2,, and from (6) |9(z,) — y(x,)| <1; whence 
the theorem. 


7. Other Methods of Solution 


Another method of proving the local existence of a solution of y’ = 
J(«, y) is Picard’s method of successive approximations. The idea behind 
this method is as follows. Suppose y = (x) is an approximate solution 
of 

d 


S=fey, yer) = | (l) 
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with the initial condition $(%) = y. Let us replace the unknown 
function y in the right-hand side of (1) by the function ¢(x). This yields 
the differential equation 


d 
oS = fla, He), yl) = Yo Q) 


which can be solved by a quadrature. It seems very plausible that the 
solution of (2) will better approximate the exact solution of (1) than will 
g(x). Picard’s existence proof consists of repeating this process to form 
an infinite sequence of approximate solutions which are then shown to 
converge to an exact solution. The process also produces approximate 
solutions of any required degree of accuracy. Observe the analogy 
between this method and Newton’s method of approximating solutions 
of algebraic equations. 

It will be more convenient to transform (1) into an integral equation; 
we state formally: 


Lemma 1. Let f(x, y) be continuous over a domain D, and let y(x) be any 
continuous function defined over some interval x, << x < 2%, and admissible 
[i.e., such that (a, y(x)) is in D]. Let x be any point such that x1 < % < iz. 
Then a necessary and sufficient condition that y(x) be a solution of the 
differential equation (1) is that it be a solution of the integral equation 


, 


y(t) = Yo + [ fey dt, m<r<% (3) 


The proof is trivial; we have already used the lemma implicitly. 

We shall demonstrate the existence of the solution of (3) by Picard’s 
method under the same assumptions as in Theorem 4. This theorem is 
closely related to the so-called fixed point theorems of topology, which 
state that given a continuous transformation of an abstract space S into 
itself, under certain conditions, there must exist points which remain 
unchanged under the transformation. Now we may take as the “points” 
of S a set of continuous functions ¢ over 7, < 4< 2,. If we consider - 
the transformation L(¢) 


L($) = Yo + [ At, $0) dt (4) 


it can be shown that S and L satisfy the conditions of a certain very general 
fixed-point theorem, and hence that there exists at least one “point” ¢ 
for which L(¢) = 4, i.e., for which (3) holds. 

We prove our theorem, however, by more elementary and constructive 
means. 
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Lemma 2. Let R: |x:— x| <h, |y — yo| << Mh be the rectangle of 
Theorem 2 in which f(x, y) is continuous and bounded by M. Let $,(x) be 
an arbitrary function with continuous first derivative and admissible in R 
[e.g., set P(X) = yp]. Define $,(x) by 


$(2) = y+ [ “MedlO)d,  je—al<h (5) 


Then $,(x) is admissible and $,(%) = Yo. 
Proof. Obviously 


Idx(2) — vol < i LA, dy(t))| dt < max |flt dot) | — 2 
< Mh, |x — x%|<h (6) 
Hence ¢,(x) is admissible. 


By repeated applications of the lemma we may define an infinite sequence 
of admissible functions by the recursion formula 


bn4i(2) ie Yo +{" Kt ¢,(t) dt, r= 0, l, 2, m8 fy |x a Xo| <= h 


. (7 
We now state our main result: 7) 


Theorem 14. If in addition to the assumption of Lemma 2, f(x,y) 
satisfies a Lipschitz condition on y in R for some constant k, then the 
sequence {f,(x)} converges uniformly and absolutely to a function satisfying 
the integral equation (3) and hence the differential equation (1). 


Proof. Let D be the error with which ¢,(z) satisfies (1) in |z — x | <A 


mi 


—fa,d())|<D, |zx-—al<h (8) 


Integrating (8) from 2 to x, we obtain 


Idol) — $4(2)| = |bol2) — Yo — [ ” Ate dy(0)) dt| < Diz —a| ©) 


We wish to prove 


ke! D ae n 
\ba(x) ae bn—1(2)| < eel 


|z — %|<h (10) 


We give a proof by induction for x = 2). Equation (10) is certainly true 
form =1. Assume (10) for =m. By the definition of ¢,(z) 


Smear) — bq(2)| < i‘ Lft b(t) — ft $y_a())|- at 
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and by the Lipschitz condition 


<k | lba(®) — bua] (11) 
Since we have assumed (10) true for n = m, we have 
bnsa(®) — bol] <P [Ce — soa (12) 
whence 
bua) — dq(ay| < OE A (13) 


This being (10) written for n = m + 1, the proof by induction is complete. 
Equation (10) may now be written 


kr-l Dh" 
n! 


, lem <h (14) 


\bn(x) i bn-1()| = 
This proves that the sequence {¢,(x)}, or equivalently the series 
be) + YF Gy) — boa) (15) 


converges uniformly to a continuous function ¢(x). But in (7) we may 
pass to the limit and interchange the limit process and integration process, 
as we have done several times before. Hence 


$2) = lim fy(2) = yo + lim | fl, (0) dt 


= vt] M0, 40) ae (16) 


This completes the proof. 

Observe that the existence of a solution has been proved only for the 
interval |w — 29|<h. Thus Picard’s method (unlike the Cauchy-Euler 
method) is suitable for finding approximate solutions only in a small 
interval about z). But in such an interval it is very convenient. The 
error after the nth approximation may be easily computed; it is 


IP() _ $n(X)| Ss y \b,41(2) = $,(x)| <= z > Eny: 


1 v! 


oe Fey an 


~~ k nt >> y! k 
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Finally we remark that the Cauchy-Euler method may be generalized 
by using more refined types of approximation to the exact solution. 
For example, the method of Adams and Stirmer uses parabolic arcs 
whose extra constants are so chosen as to fit more closely the part of the 
curve already constructed. Runge’s method uses line segments whose 
slope is determined by the value of /(z, y) at several points, using an 
interpolation formula. These methods supplant our crude ones in the 
solution of any practical problem. 


Systems of 
Differential Equations 


1. Introduction. Vector Notation 


We now consider a system of differential equations of the form: 


dx. 

are = f(%, Xq,° * °y Uys t) 

dz. 

— = f(%, X25 es ens t) 

dt 

(1) 

dx, 

“dt = f,(%, Ho,° * *y Uy; t) 


where fi, fo.’ * *, fy are single-valued functions continuous in a certain 
domain of their arguments, and 2, %,- - °, 2, are unknown functions of 
the real variable ¢. A solution of (1) is a set of functions 2,(t), 7,(t),° °°, 
x,(t) satisfying (1) for some interval t, << t < tg. 

To treat the system (1) it is convenient to introduce vector notation. 
An n-dimensional vector X is defined as an ordered set of n real numbers, 
2X4, T,° * *, Lp, Called the components of X, and X is designated by 


X= (X, Xe," * *, Ly) 
23 
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We shall always use capital letters for vectors and small letters for real 
variables (scalars). We assume that the reader is familiar with the 
elements of vector theory. By |X| we denote the norm of the vector XY 


|X| = (424 a2+-- ++ a,2)t 
If X is a function of a real variable t 


X(t) = {21(0),* + +s tA(0)} 
it is continuous in t if and only if each of its components is continuous in 
t. If each of its components is differentiable, X(t) has the derivative 


dX(t) feet deft) aro) 
dt \ dt’ dt dt 


dt’ dt’ °° at 
If X is a function of the components y,, ¥2,° * *, y, of another vector Y, 
it is continuous in Y if and only if it is continuous in the n variables y,, 
Yo," * *, Yn Simultaneously. 

Let us therefore in terms of the system (1) define the vectors 


X = (4%, %,° °°; Ln)s F(X, t) = {fir So» — ita} (2) 


where by assumption F is continuous in X and ¢ simultaneously. Then 
the system (1) may be written as the single vector equation 


dX 
a 7 GD (3) 


The vector equation (3) is obviously analogous to the scalar equation of 
Chapter 1 


© = fe) a) 


and is in fact equivalent to it when n = 1. We shall discover that all the 
theorems of Chapter 1 may be generalized so as to hold for the vector 
equation (3). 

We shall also at times consider the set of m numbers (2%, 22,° + °, %,) 
as defining a point in n-dimensional Euclidean space. Then of course 
every such point defines a vector and vice versa. We shall find it con- 
venient to use the two ideas of vector and point interchangeably; e.g., we 
shall define the range in which the vector X may vary by saying that the 
point (2, %,° ° -, x,) (or even “the point X”’) may vary through a certain 
region R of n-dimensional space. We shall use vector notation wherever 
algebraic processes are involved. This double terminology will lead to 
no confusion. 

In §§ 2-5 of this chapter the theory will closely parallel that of Chapter 
1, so we shall give detailed proofs only where there is a real difference from 
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the simpler case. In these sections the theorems are numbered so as to 
correspond with Chapter 1. 


2. Approximate Solutions 
Let Xp = (X19; Yap, * * *» %yo) and ft, be fixed. Let F(X, 1t) be con- 
tinuous in the (7 + 1)-dimensional region R 
IX-X|<b, |t—al<a (1) 


Since F(X, t) is continuous, |F| is bounded by some number M in R. 


Then as before we define an approximate solution of 
dX 
—=F(X,t 
— = F(X,1) Q) 


with error € aS a continuous admissible function X(t) with piecewise 
continuous derivative, such that 


X(t) = Xp (3) 
and satisfying 
|X’) — F(X), D| < (4) 


for all points of some interval at which the left-hand side is defined. As 
before, we have immediately: 


Theorem 1. Given «> 0, there can be constructed an approximate 
Solution of (2) with error € over the interval 


|t — t| <A = min (2, a) (5) 


For exactly as before, there exists 6 > 0 such that 
|F(X,, ty) ae F(X), ts)| <e€ 


for (Xj, ty), (Xe, te) in R, |X, — X2| < 4, |t, — t.| <6. Choose numbers 
t;, i = 1, 2,° + +,m— 1 such that 


ly<t<te<s''<th=tyth 
é\. 
|t; — t;_4| << min (s, “), i=1,2,--¢,m 
Then the approximate solution will be given by the recursion formula 


X(t) = X44 + ( — 6D F(Xp ta) 


for t,,;<t<1t,, where X;, = X(t,). 
The details of the proof proceed as before. 
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3. The Lipschitz Conditions 


We shall next prove the fundamental inequality. The necessary 
extension of the Lipschitz condition is given by the following definitions 
and lemmas. 


Definition 1. A continuous scalar function f(X,t) defined in some 
(n + 1)-dimensional region R is said to satisfy a Lipschitz condition on X 


for k if 
AX, 1) — f(Xp D| < kM — %| (1) 


for (X, t), (Xa t) in R. 

Lemma 1. Let R be convex in the components x1, %,° + *,%, of X; 
i.e., let the n-dimensional region defined by (X, t) in R, t = to, be convex 
for each ty. Then if each of the partial derivatives [of(X, t)\/ox, i= 1, 
2,° ° *,m, exists and is bounded by N in R, f(X, t) satisfies a Lipschitz 
condition on. X in R with constant k = nN. 

Lemma 2. Let R be imbedded in a larger region D and let every point 
of R have a distance from the boundary of D = 6. Then if f(X, t) and 
[Of(X, t)|/Ox, i= 1, 2,- + +,n, are bounded by M and N respectively’ in 
D, f(X%, t) satisfies a Lipschitz condition on X in R for 


k = max (= nv) (2) 


Proofs. These are analogues of the lemmas of Chapter 1, § 3, the only 
difference being in the value of k. By the mean-value theorem for 
several variables 


[f(a Var* *° *y y; t) — f(xy, Lei, BI Sy La t)| 
< $e 


(a; — «;'/) dn, 
the partial derivative being evaluated at some point of R (for Lemma 1) 
or of D (for Lemma 2). But 


< N >> |a, aad x, | (3) 


n 
2 |u, — 2,'|<nl[X— X’| (4) 


whence the proofs proceed as before. 


Definition 2. A vector function F(X, t) is said to satisfy a 1 Lipschitz 
condition on X for constant k if 


|F(X,, t) — F(Xy, 2)| < kl X, — X . (5) 
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Lemma 3.- A vector function F(X, t) satisfies a Lipschitz condition on 
X if and only if each of its components f,(X, t) does (where the constants 
may be different). 


Proof. (1) If k holds for F(X, t), then 
F(X 1) — f(X D| < [FX 1) — F(X | Sk — %| =O 
(2) If k holds for each f,, then 


n t 
| F(X, ty —_ F(X), t)| = [ Sn t) — F(X, oy | 


< pared 1) — f(Xn D|<nk|IX— % 


Finally we shall need the following: 


Lemma 4. If a vector function X(t) is differentiable over some interval 
ty <t < tq, then wherever |X(t)| # 0, [d| X(t)|]/dt exists and 


d| X(t)| ae 
| dt | dt 8) 
n 4 
Proof. If | X(t)| #0, since | X(s)| = b m0) , it follows that 
i=1 
" dx; 
| X(t)| _ >> a at 
dts X()| 
Furthermore, in any case 
[X(t + Add| — |X| X(t + At) — X(t) 
At = At 0) 


and from the existence of both derivatives it is clear that the same in- 
equality will hold in the limit. Hence the lemma. 


4. The Fundamental Inequality 


We can now state the fundamental inequality for the vector equation 
(2) of § 2: 

Theorem 2. Let F(X,t) be continuous in some (n + 1)-dimensional 
region R and satisfy a Lipschitz condition on X with constant k. Let 


X,(t) and X,{t) be two approximate solutions of (2) of § 2 over some interval 
|t — to| <b with errors €, and «, respectively. Set 


Pi) = X,t)—-X,); po=|PO|; «=ate (1) 
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Then 


| p(t)| < elt“! | n(t,)] + ce —1) (2) 
for |t — | <b. 
Proof. Wherever P’(t) exists and P(t) 4 0, by Lemma 4 p’(t) exists and 
POS |P'O| 
By the definition of approximate solution 
IP'(t)| = [X41 — X'(0| < |FXGWO, D) — F(X, D| + € 
and by the Lipschitz condition on F 
|P’O| < k| XO — X,()| + « = kplt) + « 
whence 4 fortiori we have 
P(t)<kpt) + € (3) 
From here on the proof follows exactly as in Chapter 1, since we need to 
integrate (3) only over intervals in which p(t) 4 0, and therefore p’(t) is 


defined except for a finite number of points. There is the additional 
simplification that in the present case p(t) is nonnegative. 


5. Existence and Properties of Solutions of the System 


From the fundamental inequality our basic results follow exactly as in 
Chapter 1. We omit the proofs of the following theorems; they differ 
only formally from those of the simpler case. 


Theorem 3 (Uniqueness). Let F(X, t) be continuous and satisfy a Lipschitz 
condition on X in some neighborhood of the fixed point (Xo, to). Then 
there can exist at most one solution of 


dX 
— = F(X, 1 
— = F(X,1) (1) 
such that X(ty) = Xp. 
Theorem 4 (Existence). Let F(X, t) be continuous and satisfy a Lipschitz 
condition on X in the (n + 1)-dimensional region R 


|x-X)<4 |t—4|<a (2) 


Let M be the upper bound of |F| in R. Then there exists a solution X(t) 
of (1) defined over the interval 


|t — |< h = min (2, 4) (3) 


(We omit the analogues of Theorems 5 and 6.) 
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Theorem 7. Let F(X, t) satisfy the conditions of Theorem 4. Then 
there exists a region S 
|\X¥-X|<a’ <a, |t—t|<b'<b (4) 
with the following property; if X, is a vector such that |X) — X,| <a’, 
then there exists a unique solution X(t) of (1) with the initial value X(t) = Xq, 
defined for |t — t)| <b’. 


Theorem 8. Let S be any region with the properties of Theorem 7. 
Then the solution X(t) of (1), considered as a function of its initial value 
X, = X(to), is continuous in Xq and t simultaneously, for (Xo, t) in S. 


The proof of the differentiability of the solutions with respect to their 
initial values is more difficult; we give it in full. 


Theorem 9. Let F(X, t) satisfy the conditions of Theorem 4 for R 
IX-Xl<a, |t-t| <b (5) 


F(X, 1) = {A(% 9), A(X, ° + 5 f(X, 9} 
X(Xy t) = {x4(Xo, t), LX, t), rer og X(Xo, t)} (6) 


Xo = {X19 Tap, ° * » Lo} 


Write 


r) 
where pest) = F(X, t), X(t) = Xp. Let —f(X,); ij =1,2,- ++, 0, 
dt Ox; 


exist and be continuous in X and t simultaneously, (X, t) in R. Then 
Ox,( Xo, t). 
Oj” 
simultaneously, (Xo, t) in S, where S is the region of Theorem 7. Hence 
OX(X, t) 
OX jo 


i, j= 1, 2,° + +,n, exist and are continuous in X, and t 


; j= 1,2,° + +, 2, exist and have the same properties. 


0x,( Xo, t) 
O29 

properties. Let X, be an arbitrary fixed vector such that (Xp, fo) is in S. 

Let Az, be some number sufficiently small so that the solutions (,, 

%q,* * *, L,) and (a + Ax, x, + Azz, - +, x, + Az,) of (1) with initial 

values (249, Log, ° * *» Lao) ANA (Tyo + AXyo, Loq,° * *, Xn) are defined for 

|t — t)| <b’. This may always be done. Set 


Azt) 
Ax,’ 


Proof. It suffices to prove that exists and has the required 


P&Ax, t) = =1,2,-- +n (7) 


where obviously 


l, 
(AX; to) = 
P&AX4o, bo) 0, ix1 
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We shall prove that 
lim p(Azyp, ¢), i=1,2,--+Vn (8) 
Az +0 


exists uniformly for (Xo, t) in S. By Theorem 8, z,(t) [hence p,(Azo, ¢)] 
is continuous in 249, Zo9,° * *,XZno, ¢ simultaneously for (Xp, ¢) in S. 
Therefore by virtue of the uniform convergence of (8) it will follow that 
Ox (Xo, t) _ 

Wy 


lim p(Az%p, t) 
At%o->O 


has the required properties. 
By applying the fundamental inequality to the two solutions (7, 
Xq* * *, Z,) and (x, + Az,, %, + Axs,* - +, 2%, + Az,), we obtain 


V Az? + Ave +--- + Ax,2<e*lAx,|, |t—t/<b' (9) 
By the theorem of the mean in n variables, we may write 


dAx{t 
PAD fey + Mitty + Dit + +5 y+ Btys 1 
— fl®y) Uy, * * + Lys te 
= Sellen Me + 5 th) Bay + 140 (10) 
where 
lim 20 __ = 0 (11) 
A290 LS Axs 
k=1 


uniformly for (Xo, t) in S. Then by (7) and (10) we may write 


bike!) t) -3=5 oo —— faq) Lyy* * qi OP, + F(A, 1) (12) 
where 
t= ee (13) 
But by (9) 


f= | === | | = (14) 
‘| An . J 2 Aa 


Hence by (11) 
lim |£{s)| =0 (15) 
AX -+0 ; 


uniformly for (Xo, t) in S. 
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Now consider the system of differential equations in m unknowns 
qI,i= 12,5 “yn 


dq; _ 2% 0 : 
dt a >> On, PACAGS £,(t), , X(t); 1)4y (16) 
with initial conditions 
(4) 1, i=l 
(fo) = 
een lG. Weill 


The right-hand side of (16) is obviously continuous in the qg’s and is 
continuous in ¢ by definition. By § 3, Lemma |, it satisfies a Lipschitz 
condition on the q’s. Hence (16) satisfies the hypotheses of Theorem 4. 
Now the system (p,, Po,* * *, Pp) satisfies (16) with an error 


« =nmax |C,(0), i=1,2,-°-+,n, (X,, t) in S (17) 
Hence by the proof of Theorem 5 (as in Chapter 1), lim p; = q;; i= 1, 
e—0 


2,° ° +, , uniformly for (Xo, t)in S. Therefore by (15) and (17), lim p(t) 
exists, uniformly for (Xo, t) in S. This completes the proof. 4%»*° 

From Theorem 9 we can immediately deduce the following theorem 
on continuity and differentiability of the solution with respect to 
parameters: 


‘Theorem 10. Consider a system of differential equations in which the 
functions f; depend upon any number of parameters (y,° * *, bm 


dx . 
= = f(x, Xo, By SO Xn Ay He» Rats Ln; t), i= 1, 2, 7c yn (18) 


If each of the f’s has partial derivatives with respect to 2, %,° * *, Xq} 
Hy» Pos’ * *s Mm continuous in some (n + m + 1)-dimensional region R, 
then the solutions 


Xy(Xyq, ®oor° * > Xn9s Mis Her* * *s hms t), i=1,2,°°*-+,n (19) 


will have partial derivatives in [y, Me,* * *; bm continuous in all their 
arguments through whatever part of R the solutions (19) are defined. 


Proof. Regard the parameters fy, fe,° * *, 4m aS new variables, and 
adjoin to (18) the additional equations 


las oY J=1,2,-°+,m (20) 


Then all the conditions of Theorem 9 are met by the system (18) + (20); 
hence the solutions x, have the required properties with respect to the 
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“initial values” of the w’s; since the yw’s are taken as constants with 
respect to ¢ the result follows. 

We shall not need the analogues of Theorems 10 and 13 of Chapter 1. 
We state the two following theorems again without proof: 


Theorem 11. If in some bounded (n + 1)-dimensional domain R, F(X, t) 
is continuous and locally satisfies a Lipschitz condition in X, the solution of 


dX 
= = GD (21) 


passing through any point of R may be uniquely extended arbitrarily close 
to the boundary of R. 


Theorem 12. If R is the domain 
Latte X unrestricted 


and if F(X, t) satisfies a Lipschitz condition uniformly in every subdomain 
of the type 

h<t'<t<t’<t, all X 
then the solution of (21) passing through any point of R may be extended 
throughout the entire open interval ty < t < tg. 


This is an immediate consequence of the fact that the solution can be 
extended throughout any subinterval of t; < t < 4. 

As an example,.take the extremely important case of a linear system; 
ie., a system of the form 


Hs = Safest), F=12-+4m —— Q2) 
dt j=l 
where a,,(t) and 5,(t) are continuous in some open interval t, < t < fg. 
By § 3, Lemma 1, (22) satisfies the conditions of Theorem 12; hence the 
values of the x’s at any point f) (where t; < ft < f,) determine a solution 
(of course uniquely) throughout the whole open interval ty <t <t,. The 
study of such systems will occupy the following chapters. 
Finally we remark that Picard’s method may be applied to the general 
vector equation; the generalization of the process of Chapter | is perfectly 
straightforward. We shall see an application of this in the next chapter. 


6. Systems of Higher Order 


Consider the differential equation of nth order in one unknown 
d"x . 
a = f(a, + +, 2, x; t) (1) 
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This equation may always be replaced by a system of differential equations 
of the first order. To do this, we take x and its first (7 — 1) derivatives 
as new unknown functions x,, 2,° - -,,, Which must obviously satisfy 
the relations 


dx 
“ao 
dXLn-4 
n _ y) 


dx. 
Lew * *, Xe, Ly; t) 


It is easily seen that (1) and (2) are equivalent, ie., that any solution of 

(2) defines a solution of (1) and vice versa. It is also clear by the existence 

and uniqueness theorems that wherever / satisfies a Lipschitz condition 

on its first m arguments, the solutions of (1) will exist and be uniquely 

defined by the value of x and its first (n — 1) derivatives at any point fp. 

All our other theorems admit of obvious generalizations to the equation (1). 
If in particular f is linear in the 2’s, i.e., if 


‘ n 


S Gn Gets 65% O= 2, at)x,; + b(t) (3) 
it is clear that (2) will satisfy the conditions of Theorem 12, hence that 
the solutions of (1) may be extended throughout any interval in which the 
a’s and b’s are continuous. 

Finally, any system of differential equations of higher order in several 
unknowns may be. reduced to a system of the first order, provided only 
the system is explicitly solved for the highest derivative appearing of each 
unknown function 

d™z, 
an 


where k = 0, 1,- + +,m;—1; i, 7 =1,2,°-°+,n. It is easily seen that 


Silas, t) (4) 


n 
(4) is reducible to an equivalent first-order system of N = > m, equations, 
i=1 


and that all our theorems likewise hold for such a system. 


Linear Systems of 
Differential Equations 


PART A. 
THE GENERAL LINEAR SYSTEM 


1. Introduction. Matrix Notation 


We consider in this chapter /inear systems of the type 


os = > a,(t)x; + 6,2), i=1,2,°--,n (1) 
at j=1 

where a,,(t), 5,¢) are continuous functions of t over some interval 
tl, <t<t,. We have already seen (Chapter 2, Theorem 12) that if t 
is any number, t, < ft < ta, then corresponding to any set of numbers 
X19, Xap, ° * *, Lng there exists a unique solution of (1) satisfying the initial 
conditions 

X(to) = Xo» i=i,2,- +n (2) 


uniquely defined over the whole open interval t, < t < tg. 

If in (1) the functions b(t) are all identically 0, the system is said to be 
homogeneous. In the homogeneous case, the system (1) is completely 
determined by the matrix 


34 
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QA, Gp °° * Ain 


4m 4° °° Gan 


1 ne ® 


Q@ny Ang °° * Gan 
where we write W(t) to indicate that each of the elements of the matrix 
(3) is a function of ¢. Now corresponding to any n-dimensional vector 
X = (a, %,° * *, Z,) and any such matrix & (where X and W may both 
be functions of t), we define the /inear transform of X by as the vector 


Y = (Y, Yoo" * ‘+ Yn) 
whose components y, are given by 


n 


Yi, = > a,j, i=1,2,--°,7 (4) 


j=1 
Symbolically we may write the relation between Y and X as 
Y= AX (4’/) 


With this notation, the homogeneous case of (1) may be written as the 
yector equation 


dX 
= MOX (5) 


Consider now the totality of solutions X(t) of (5). 


Theorem 1. The set of solutions X(t) of (5) has the properties: 

(a) X(t) = 0 is always a solution (the “‘trival solution’’). 

(b) If X(t) = 0, ty some number, ty < ty < te, then X(t) =0, < 
t<t,. [For X(t) = 0 isa solution and we know that the solution is unique.] 

(c) If X(t) is a solution, then cX(t) is a solution, c any constant. If 
X,(t) and X,(t) are solutions, then X,(t) + X,(t) is a solution. Hence the 
solutions of (5) form a linear manifold; i.e., any linear combination of 
solutions is itself a solution. 


2. Linear Dependence. Fundamental Systems 


We recall that m n-dimensional vectors X,, Xo,° - *, Xj, are said to be 
linearly independent if the identity 
m 
> X; = 0 (1) 


i=1 
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c,; constant, implies c; = 0, i = 1, 2,: - -,m. Otherwise the vectors are 
said to be linearly dependent. It is easy to see that if the vectors Xj, 
X,,° * *, X, are linearly dependent then at least one of them (say X,,) 
can be written as a linear combination of the others 
m—1 
Xn= DY X, (2) 
i=1 
We assume the following facts as known. No (n + 1) n-dimensional 
vectors can be linearly independent. There exist sets of n linearly in- 
dependent vectors; e.g., the unit vectors 


£,=(1,0,- + +,0), 2, =(,1,-°°,0),: °°, £,=(,0,--+1) @Q) 


If X,, Xo,: ° +, X, is a linearly independent set of m vectors, then any 
vector can be written as a linear combination of the X’s; but this is true 
for no set of k <n vectors. A set of n vectors 


Xx; = (X35 Via, - Lin)s i= 1, 2; 7 yn 


is linearly independent if and only if the determinant |z,,| is different 
from 0. Any nonsingular linear transformation of a linearly independent 
set itself defines a linearly independent set. 

A set of functions f(t), i= 1, 2,°--+,m, is said to be linearly 
independent over the interval t, < t < t, if the identity 


m 


2 ef{(y)y=0, th<t<t (4) 
i= 
implies c; = 0, i= 1, 2,- + +,m. In the case of functions whose values 
are vectors, we define linear independence in an analogous way. The ¢; 
in this section always denote constants. 
Returning now to the homogeneous linear system of § 1 

dX 

Pam A(t) X, h<t<t (5) 
we prove: 


Theorem 2. Let ty be any number, t, < ty < ta, and let X,(t), X,(t), 
- +, X,,(t) be any set of m solutions of (5). Then a necessary and sufficient 
condition that X(t), i = 1,2,- > +, m, be linearly dependent over ty << t < ty 
is that the constant vectors X;(to), Xo(to), °° *» Xm(to) be linearly dependent. 


Proof. If there exists a nontrivial relation 


DVaX(=0, 4<t<k 


t=1 
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a fortiori the same holds for t = ft. Hence the necessity. To prove 
sufficiency, suppose there exists a nontrivial relation 


m 


2 c,X(t) = 0 (6) 
Consider the function 
X(t) = > ¢;X((t) (7) 


4=1 


X(t) is a solution of (5), and since it vanishes for t = f, by Theorem 1 it 
vanishes identically. Hence the system X(t), X,(t),-- -, X,,(t) is 
linearly dependent. 

There now follows immediately the basic theorem on linear systems. 


Theorem 3. No linearly independent set of solutions of (5) can contain 
more than n members. There exist linearly independent sets containing n 
members. 


Proof. If X,(t), Xo(t),° + * X,43(t) were linearly independent solutions, 
it would follow by Theorem 2, setting t = ft, t, < t% < f, that the (n + 1) 
constant vectors X;(to),i = 1,2,- + -,m + 1 would belinearly independent, 
which we know to be impossible. On the other hand, if we define n 
solutions X,(t) by 

Xt) = E; i=1,2,-+-n” (8) 


the E’s being the unit vectors, then these X’s are obviously linearly 
independent. 


Corollary. If X, X2,° ° *, X, is a set of n linearly independent solutions 
of (5), then any solution X(t) of (5) can be written in the form 


n 
X(t) = 2, c,X;(t) (9) 
i= 
Proof. By Theorem 3 there must exist a nontrivial linear relation 
among X and the X;,’s 


cX + > ¢,X,=0 (10) 
j=l 


If c = 0, the X;,’s are linearly dependent, contrary to hypothesis; hence 
we may divide by c and obtain the required relation. 

For this reason a set of m linearly independent solutions of (5) is known 
as a fundamental system or linear basis of (5). The most general solution 
of a linear system may therefore be written as a linear combination of the 
members of the fundamental system. It is clear that no set of k<n 
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solutions enjoys this property. A set of solutions of (5) forms a funda- 
mental system if and only if the determinant of their components, |z,,|, 
is different from 0. This determinant will either vanish identically in ¢ 
or vanish for no value of t in the interval t, << t < t. 


3. Solutions Expressed in Matrix Form 


We shall need to introduce a few concepts of matrix algebra. If 
and @ are two square matrices of order n 


BA =[a,) B= [by] 
their sum is defined to be the matrix 
A+ B= [a,, + b;5] 
Their product is the matrix @ representing the linear transformation 
Y = A(BX), i.e. 
C=AB=[cyl cy = ») Aydy 


k=1 


Such products are known to be associative. 
If the elements a,, of a matrix ~ are functions of a variable t, we may 
define new matrices by differentiation and integration 


df [da;; ¢ t 
a = 2] re dt = | | 20 a 
It is easy to verify that 


dl +B) _ dol dB dB) _ dB | def 
dt dt dt dt dt dt 


and similar formulas. ‘Finally we define the identity matrix &: 


10 ---. 0 
0 1 cee 0 
= . 8 re . 
00 ---. 1 


which has the property that for any vector X, X¥ = &X. 
Suppose now that <(r) is an n-matrix whose components a,, are 
continuous functions of t over some interval ty < t < %. 
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Then our homogeneous linear equation may be written 


“= A(t)X, X(t) = Xp; i<t<t (1) 
Set 
Bt) =[ dt (2) 
We consider the special case where ” and # are commutative; i.e., that 
AB=BA, iL<t<t, (3) 
This will in particular be true if is a constant matrix, for then 
B=(t—t)AS 


Under this assumption, we shall obtain the solution of (1) in explicit 
form by Picard’s approximation method. 
It follows from the commutativity of W and # that 
qa” = mad BP, m=1,2,:° > (4) 
dt 
Let the initial value Xp be the first approximation. Then the next approxi- 
mation, X(t), is defined by 


dX. 
; on =HX, X(t) = X% (5) 
1.€., by 
€ 
fo 
The mth approximating solution, X,,(t), will satisfy 
dX in 
ai = LX 1, Xm(to) = Xo (7) 
Using (4) and (6), it follows easily by induction that 
B gr 
Xt) =F + B+ +o +7 X, (8) 


But by the proof of Picard’s theorem (as generalized from the special case 
of Chapter 1) it follows that as m —> oo, X,,(t) converges uniformly in a 
sufficiently small neighborhood of f) to a solution X(t) of (1). 

If we set X) = E;, the ith unit vector, the right-hand side of (8) ediines 
to the ith column of the matrix 


G+ > — (9) 
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whence it appears that each of the terms of (9) approaches a limit uniformly 
in t as m-> oo. The matrix so defined we call for obvious symbolic 
reasons e*. Then the solution of (1) may be displayed in the form 


X(t) = e® X, = exp {[ a iG (10) 


In particular, the n columns of e* considered as vectors constitute a 
fundamental system of (1). 
Expressing the condition that (10) be a solution of (1), we have 


d(e*) 


ae X, = Le®X, (11) 
and substituting X, = &, (11) reduces to 
di B 
e -_ A e® (12) 


This is of course in agreement with our symbolic notation, since d@/dt = 
xf, Observe the analogy between (10) and the solution of the simple 
differential equation 


= = a(t)x, X(t) = py 
namely 
X = Xp EXP eo a (13) 


This is of course the special case of (10) when n = 1. 

It should be remarked that the series 

6+ — (14) 
m=1 m! 

defining e® not only converges in the restricted interval of Picard’s theorem, 
but converges uniformly over any closed interval in which each of the 
elements of V(t) is continuous. In fact, if A is the length of the interval, 
and if in the interval each of the elements of «(t) is bounded by some 
constant M, the reader will prove without difficulty that each of the 
elements of B” is bounded by n™—"(Mh)™. Hence it will follow that over 
the given interval (14) converges uniformly. Since the differentiated 
series has the same property, formula (12) is still valid. Hence (10) is 
easily seen to give the solution of (1) over the entire interval in which (ft) 
is continuous, i.e., in which the solution of (1) exists. 

It should be emphasized that this section is based on the assumption 
that (3) holds. 
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4. Reduction of Order of a System 


Suppose k <n linearly independent solutions of the homogeneous 
system 


n 
a Date, i=1,20+ +n (1) 
dt j=l 
have somehow been obtained. We shall show that (1) may be “‘reduced 
to a system of order n — k” in the following sense; if a fundamental 
system of solutions of a certain system of order n —k is known, a 
fundamental system of (1) may be determined by quadratures alone. 


Theorem 4. Suppose there has been found one nontrivial solution X(t) 
of (1) 

X(t) = {m(t), ne(t),* + +s M(t} (2) 

Assume one of its components (say without loss of generality y,) vanishes 


nowhere on ty <t <t.* Then (1) may be reduced to a system of order 
n—1. 


Proof. Define new variables y,, i = 1, 2,- + -,n, by 
n{t) —°  , 1 

_ ns i=1,2,:°-°,n—1, =— 27, (3 
na m= © 


This is possible since 7,(¢) vanishes nowhere, and the transformation is 
nonsingular since we can write 


u=Yy,+ NMt)Y ns i=1,2,---,n—1; Ly, = NAA) n (4) 


We wish to express the y’s as solutions of a homogeneous system. 
Differentiating (3) we obtain 


dy; _ dt, (2) — es 


¥, = 4; 


= — t,—\|— — i=1,2,---+n—1 


dt dt “dt\n,/ 1, at’ 

dy, 1dx,  d ( 1 

‘dt 1", dt ta Nn 4m (5) 
Substituting in (5) first for dx,/dt from (1) and then for x, from (4), the 
y’s are seen to satisfy a system of the form 


= 2 bislt)Ys, P= 12.8 65 n (6) 
j= 


where the b’s may be explicitly calculated; they are certainly continuous 
over 1; <<t<t,. It is easy to see that (1) and (6) are equivalent; i.e., 


* This will in any case be true over some sufficiently small interval. 
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that to every set of x’s satisfying (1) there is defined by (3) a set of y’s 
satisfying (6), and vice versa. The solution of (6) corresponding to (2) is 


0, ixn 
y= : (7) 
if i=n 
Substituting (7) in (6) we obtain, without making a long calculation, the 
relation 
b,,(t) = 9, i=1,2,---n 


Therefore in particular the first (7 — 1) equations of (6) do not contain 
y,, at all; hence they may be treated as a separate linear system of (n — 1) 
equations in the (n — 1) unknowns 4, Yo,° * *,Y,-1. Suppose a funda- 
mental system 


Zi = Yar Yias* * *s Yin-ws i=1,2,°°-+;2-1 (8) 


of this set of equations is known. Then functions y,,, i= 1, 2,° °°, 
n — 1, may be defined by quadratures from 


AY in _ met 
oa > badt)yidt) (9) 


If we adjoin the functions y,, as nth components to the vectors Z;, we 
construct a set of (m — 1) n-dimensional vectors Y,, Yo,° °°, Yp-1. 
They are by construction solutions of (6), and a fortiori linearly in- 
dependent. Let us now adjoin to this set the vector Y,, = (0, 0,- - -, 1). 
Then ¥,, Y,,° - °, Y,, are all solutions of (6). Suppose we could write 


> ¢¥,=0 


°. 
{| 
- 


The first (x — 1) c’s must be 0, since the Z’s are linearly independent. 

Since Y,, ~ 0, it follows that c, = 0, and the Y’s are linearly independent. 

If finally we define from the Y’s a system of n vectors X;, X,° °°; Xn 

by (4), these will all be solutions of (1), and the proof is complete. 
Suppose now there is given another solution 


X(t) _ (4 fe, eee Sy cn) (10) 


of (1) linearly independent of (7,, 72,° - *,,). Then a particular solution 
of the first (x — 1) equations of (6) is 

Z= (6,, 6, ey: 6,,-) 
where 


0, = be — Tb Po to eee en ee (11) 


n 
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If 6, = 0, solutions (2) and (10) are linearly dependent; hence on some 
interval one of the @’s never vanishes. Then we may apply our method 
again on (6) using the particular solution (11) and reduce the order of 
(1) to n — 2. 

Proceeding in this way, it is easy to prove that, given k < n linearly 
independent solutions of (1), there is some interval in which (1) may be 
reduced to a system of order n —k. In fact, any closed interval of 
t; <t< ft, may be covered by a finite set of such intervals, and the 
fundamental systems defined in each covering interval may be ‘“‘matched” 
so as to define a fundamental system over the whole closed interval. 
Thus the problem of reduction of order is substantially solved. 


5. Nonhomogeneous Systems 


Consider now the nonhomogeneous linear system 


Bios aie Pb). Laas (1) 
dt j=l 


or in vector notation 
— = AX+B, B= (bh, be, * : *, b,) (1’) 


Let a,,(t), b(t) be continuous on t,<t<4t,. Suppose X(t) is any 
particular solution of (1). Then if X,(¢) is any other solution of (1), 
obviously X(t) = X,(t) — X,(t) is a solution of the homogeneous 
equation 


ax, = 

ean At 

dt p> al), 
ie., of 

dX 


Theorem 5. The most general solution of the nonhomogeneous equation 
(1) is obtained by adding to any particular solution of (1) the general solution 
of the homogeneous equation (2). 


Therefore if the homogeneous equation has been solved, all we need 
do to find the general solution of (1) is find one particular solution of (1). 


Theorem 6. A particular solution of the nonhomogeneous system (1) 
may be obtained from a fundamental system of (2) by means of quadratures 
alone. 
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Proof. Let X;, X2,° + +, X, be a fundamental system of (2). We use 
the so-called “method of variation of constants.” Suppose a particular 
solution of (1) can be written in the form 


X= > ¢,X, (3) 
t=1 


where the c’s are not constants as in the homogeneous case, but unknown 
functions of t. We shall try to satisfy (1) by a suitable choice of the 
functions c,. Differentiating (3) and substituting in (1) we have 


So, + YX, = Yew X, + B (4) 
is. ty dS 
or since the X;,’s satisfy (2) 
n dc, 
X,—=8 5 
p> iF (5) 
If we now set 
X, = (jy, Vien" °°, Vin) (6) 
(5) may be written in the form 
Say —! = by i=1,2,---,n (7) 
j=l dt 


Since |z,;| is nowhere 0 (end of § 2), (7) may everywhere be solved for 
dc,/dt. Thus 

dc; 

Sp = alt 8) 


where p,(t) are continuous functions that may be easily calculated. 
Equation (8) may be solved by quadratures; and any solution ¢, c,.° * *, 
c, Of (8) will define a particular solution of our nonhomogeneous system 
(1) by means of (4). This completes the proof. 


PART B. 
LINEAR EQUATIONS OF HIGHER ORDER 


6. Fundamental Systems 


We consider now the linear differential equation of order n in one 
unknown 
d"x ae 
ae + Pit) tt + pe + qt) = 0 (1) 
t dt 
where the functions p,t) and g(t) are continuous over some interval 
t,<t< ty. We are interested only in normalized equations where the 
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coefficient of d"x/dt"is 1. If d"x/dt" has a coefficient po(t) which vanishes 
nowhere, the equation may be normalized by dividing by pp. 
We consider at first the homogeneous case in which q(t) = 0. We shall 


generally use the notation 
d"x 


n d™I1z 
L(x) = apt 2 PM) 7a = 0 


(2) 


Following the method of Chapter 2, § 6, we may replace (2) by the 


linear system 


dx 
ae 
dx, 
dt? 
Xp 
da 
dx, 
dt = = PyX_ — Pony — °° * = Pn 
or in vector form 
aX 
= SAX 
where the matrix .% is 
0 1 0 0 
0 0 1 0 
A(t) = 
0 0 0 0 
—Pn —Pn-1 —~Pn-2 —Pe2 


(3) 


(3’) 


(4) 
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Then as we have seen in Chapter 2, § 6, (2) and (3) are equivalent. For 
if x = ¢(t) is any solution of (2), then 


X() = 4,9," > gl") (5) 
is a solution of (3’); and conversely, if X(t) is a solution of (3’), its first 
component is a solution of (2). It follows from the remarks in Chapter 
2, § 6, and the results of Part A of this chapter, that the values of 2 and 
its first (2 — 1) derivatives at some point ft) (where t, < t < ¢,) define a 
unique solution of (2) over the whole open interval 4, <t< 4. If at 
any point ¢ = %), x and its first (» — 1) derivatives all vanish, x(t) is 
identically 0. The solutions x(t) of (2) form a linear manifold. 

It is easy to see that the correspondence we have set up between the 
solutions of (2) and (3) preserves linear independence. For if $,(t), $2(t), 
- + +, ¢,,(t) is a linearly independent set of solutions of (2), a fortiori the 
m vectors 

X(t) aa (Cs :'» nF ot pi"), i= l, zy “sty m (6) 
are linearly independent. On the other hand, if there exists a nontrivial 


linear relation 
m 


2d =0 
differentiating we obtain 
Sed.LNO == 0, j=l,2,-+ on (7) 
iz 
and the equations (7) are simply the components of 
Sax 7 =0 (8) 


Therefore the X’s also are linearly dependent. This means that the ¢’s 
are linearly independent if and only if the X’s are. 

Hence as in § 2 it follows that there can exist no set of (” + 1) linearly 
independent solutions of (2), but that there exist sets of n linearly indepen- 
dent solutions of (2). Any such set will be a fundamental system of (2). 
Take a point fg, ty < ty < te. If$,(t), $2(¢),- + +, },(t) are any n solutions 
of (2), they constitute a fundamental system if and only if the determinant 
of the components of the X’s taken at t = f fails to vanish 


1 de oy Pn 
di he As Dy 
Wb b° * Gn) =]: : cae #0 (9) 


pl" pl" ear $i" 
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This determinant is known as the Wronsky determinant or Wronskian of 
the functions ¢,, ¢o,° - °,¢,- We may then state our results thus: 


Theorem 7. The homogeneous equation (2) always has a fundamental 
system of precisely n solutions. A set of n solutions ¢,(t), $.(t),* > +, $,(t) 
of (2) constitutes a fundamental system if and only if its Wronskian W(q,, 
do," * *s $,) fails to vanish for some point to, ty < ty < tg. The Wronskian 
will either be different from 0 everywhere on this interval or vanish identically. 


7. The Wronsky Determinant 


We shall now consider the significance of the Wronksian of any n 
arbitrary functions (not necessarily solutions of some differential equation). 
It will appear that the vanishing of the Wronskian is related to their linear 
dependence; but the criterion is not as simple as in the special case of 
Theorem 7. 

Theorem 8. Let ¢,(t), ¢2(t),° °°, ?,(t) be any n functions continuous 
together with their first (n — 1) derivatives over some interval t, << t < tg. 

(a) If the $’s are linearly dependent over ty << t < te, then 

Wb, 2° * *s bn) = 9, h<t<h 

(b) Suppose 

(1) Wh, bo ne) bn) = 0, hy <t< he; but 

(2) for some (n — 1) of the ¢’s (say without loss of generality, all but },) 


Wr b2°* bra #O0,allt, h<t<h (1) 
then $1, $o,° * *,, are linearly dependent over ty << t < ty, 
Proof. 
(a) If there exists a nontrivial relation 
2 ch) =0 
= 


Differentiating we obtain 


n 


Yes PMHS0, fHl%- sn 2) 


i=1 
Equation (2) implies that for t = f), each fo, the determinant of the c’s 
will vanish. This determinant is precisely W(¢,, $2, ° - *, ,)- 


(b) The equation 
Wr $2° * > bn» *) =0 (3) 


is a linear homogeneous differential equation of order (mn — 1) in x with 
continuous coefficients. The coefficient of (d""x)/dt"") is W@4,, 
go, * * ‘> Sy—y), and since by (1) this never vanishes, (3) may be normalized. 
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Now 4), $0, ° * *, $,-; are all solutions of (3) since they make two columns 
of the determinant equal. x = ¢, is a solution by hypothesis. Since (3) 
is only of order (n — 1), its n solutions 4,, ¢o,° - -,¢, must be linearly 
dependent. 

Part (b) of Theorem 8 may be proved directly without the use of the 
theory of differential equations, but the proof will be longer. The second 
part of the hypothesis in (6) cannot be omitted, but we may prove the 
corollary: 


Corollary. Let ¢$,(t), $o(t),- + °¢,(t) be any n functions continuous 
together with their first (n — 1) derivatives over some interval ty <t < tg. 
Let 

Wor, bo, ee, bn) = 0, h<t<h (4) 


Then there exists a subinterval over which $, $0,° * *,¢, are linearly 
dependent. 


Proof. We give a proof by induction. The theorem is trivial for 
n= 1. Suppose it true form = m-— 1. Then consider the m functions 
d1, bo,° * ‘5bm- If the Wronskian W(¢,, $2,° * *m—1) fails to vanish 
for some point fo, t) < ty < fj, there is some interval of f) in which it does 
not vanish, and Theorem 8 applies. Otherwise the smaller Wronskian is 
identically 0, and by our induction ¢,, $9, ° + *, my are linearly dependent 
over some subinterval of t; << t<t,, This completes the proof. 


8. Further Properties of the Fundamental System 


We have seen that to any normalized linear homogeneous equation of 
nth order there corresponds a fundamental system of 7 solutions whose 
Wronskian never vanishes. We can prove the following converse. 


Theorem 9. Let ¢,(t), $o(t),- * *,¢,(t) be any set of functions con- 
tinuous together with their first n derivatives over some interval ty < t < to; 
and let their Wronskian vanish nowhere. Then there exists a unique 
normalized equation of nth order 
possessing $1, $o,° * *, $y, a5 a fundamental system. 


Proof. The equation 
Wh, os “fy Pn» L) =i 
Why, Pes Pes Pn) 


is a homogeneous linear equation of nth order in x with continuous 
coefficients. The coefficient of (d"x)/(dt") is 1. Equation (2) has the n 


L(a) = (2) 
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solutions ¢,, ¢,,° - -,¢,; and by Theorem 8 the ¢’s are linearly indepen- 
dent, since their Wronskian does not vanish. Therefore (2) satisfies all 
the conditions and the existence is proved. 

Suppose now there exist two such equations 


L,(x) = 0, L(x) = 0 
Consider the equation 
M(x) = L,(x) — L,(x) = 0 (3) 


It is a homogeneous linear equation with continuous coefficients. If any 
one of its coefficients is anywhere different from 0, there exists some 
interval over which it can be normalized. But its order is at most (n — 1), 
and it possesses the 7 solutions ¢,, ¢.,° : :,¢, Which are linearly in- 
dependent over any interval (their Wronskian being nowhere 0). There- 
fore M(x) cannot be normalized; i.e., it vanishes identically. Hence the 
equation (1) is unique. 


Theorem 10. Let ¢,, $9,° * *,$, be any fundamental system of 
dz a's 
ey adie 7 pseneelee 4 
L(x) aa + 2, PAt) qa 0, hare (4) 
Set W(t) = W(4,, bo,° + *,¢,). Then if ty is any number, t, < ty < ty 
t 
W(t) = W(to) exp —- i Pilt) a (5) 


Thus the value of the Wronskian is essentially independent of the 
particular fundamental system we choose. This result explains the fact 
that in Theorem 7 we have only to assume that W(t) fails to vanish at one 
point. 


Proof. By virtue of the uniqueness of L(x) in Theorem 9 we may write 
Why ba * * > $n» v) (6) 
Wh, 2» ane $n) 


The identity in (6) implies that the coefficients of each term on either side 
are identical. In particular, equating the coefficients of (d"“x)/(dt"), 


L(x) = 


we have 
$y dy Pn 
1 pe, bn 
AO = — Ho] . bites J ™ 
g,l-21 gf] ..- g,lr-2 


¢, i") gat"! -o se inl 
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Now the determinant in (7) is exactly [dW(t)]/dt; for by the well-known 
formula for the derivative of a determinant 


oy be ee ane 
oy a mie Dy 

Wit=| | ee + (n — 2) terms 
4,001 4,1 4,01 


¢ de st $y 


He ae. aes i) 
p21 gl]... pina} 
dil fh) ww. hin) 


where the first (n — 1) terms vanish identically, having two equal rows. 
Therefore 


LOS = (9) 


and integrating (9) we obtain (5). This completes the proof. 


9. Reduction of Order 


We recapitulate briefly the theory of § 4 in the case of a single equation 
of higher order. 


Theorem 11. Let n(t) be a nontrivial solution of 


dz & d"—*x 
L(x) = — () —— 1 
a) = Te + & Pl) a () 
Then the transformation 
o= ny (2) 


reduces (1) to an equation of order (n — 1) in z =’ over any interval in 
which n vanishes nowhere. 


Proof. Differentiating (2) and substituting in (1), we obtain 
L(ny) = yl"In + lower order terms 


Dividing by 7 we see that y satisfies an equation of the form 


yl + > g(hy‘l = 0 | (3) 
t=1 
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where the g’s are continuous. Equations (1) and (3) are equivalent, and 
since y = 1 is a solution of (3), g, =0. Hence if we set z=y’, we 
obtain an equation of order (m — 1) in z 


gin—1] + qyzlt—] + ore ws + In-1% = 0 : (4) 


Suppose a fundamental system of (4) has been obtained. We may then 
define (n — 1) solutions of (3) from this fundamental system, and adjoin 
to these the solution y= 1. The proof that the m solutions of (3) so 
defined are linearly independent follows easily from z = y’; the reader 
can verify the fact directly. By (2) we may then define 7 solutions 
of (1), which will be linearly independent [(2) being nonsingular] and 
hence constitute a fundamental system of (1). This completes the proof. 

As before, it may easily be shown that, given k < n linearly independent 
solutions of (1), (1) may be reduced to a system of order (n — k). 


10. The Nonhomogeneous Case 


We consider finally the nonhomogeneous equation of nth order 
La@+qt)=0 t<t<t (1) 
d"x . & 
L@)=—, + >> pdtjat4 


Equation (1) is equivalent to the system 


(2) 


Pes — Prt —* *° — P\X, — Gt) 


Exactly as in Theorem 5, we prove that the most general solution of (1) 
may be obtained by adding to a particular solution of (1) the general 
solution of L(x) = 0. Referring to Theorem 6 and making the appro- 
priate substitutions, we can apply the method of “‘variation of constants” 
to obtain a particular solution of (1). 
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Theorem 12. Let $4, $o,° * *,¢, be a fundamental system of L(x) = 0. 
Then a particular solution y(t) of the nonhomogeneous equation is given by 


y(t) = > etypt) (3) 


t=1 
where the c’s satisfy the system 


>of) =0, j=1,2,-++4n-1 


4i=1 


S cP = —g) 


t=1 


(4) 


The proof follows directly from Theorem 6; however, the following 
direct verification is interesting. The determinant of the coefficients of 
c,’ in (4) is precisely W(¢,, $,° * + .%,). Since by hypothesis this vanishes 
nowhere, (4) may everywhere be solved, and the c’s determined by quad- 
ratures. From (3) and (4) it appears immediately that 


n 


pr) = > c(h), k=0,1,--42-1 
~ 6) 


prt) = 2 (toi) — q(t) 
Setting x = y in (1) and using (5), it follows that 


Ly) = ¥ aL) — a0 
Since L(¢,) = 0, we obtain 
L(y) = — q(t), Wolt<ch 


which completes the proof. 


11. Green’s Function 


We shall consider in this section a method whereby the solution of the 
nonhomogeneous linear equation 


L(x) + q(t) = 0 (1) 
where 


d"x a : 
Lea) = > + > pairs 


may be obtained from a single function associated with L(x) known as 
Green’s function for L(x). It will be more convenient for this problem 
to assume that p,(t) and q(t) are continuous over some closed interval 
axt<b. 
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A solution 2(t) of L(x) = 0 satisfying the initial conditions 
x(a) = 0, x'(a) = 0,- : -, l(a) = 0 
must vanish identically, provided we assume (as we have up to now) that 
x(t) and its first (7 — 1) derivatives are continuous over [a,b]. Let us 
drop this last assumption and consider a solution x(t) whose (n — 1)st 
derivative z["—(t) has a discontinuity at some point s, a<s<b. 
Precisely we consider the function given by the following definition: 


Definition 1. Let x(t) be a function defined over a< t < b such that 
(1) x(t) and its first n derivatives exist and satisfy L(x) = 0 for every 
point ofa<t<b except possibly the single point t = s. 


(2) x(a) = 0, 2’(a) = 0,- + +, al" Na) = 0. (2) 
(3) The functions x(t), x'(t),- + +, xl"-*)(t) are continuous at s (hence 

everywhere ona<t<b). xl"—(¢t) has a discontinuity at t = s 
gl"-1ll(s — 0) = 0, al™-"s + 0) = — 1 (3) 


Then x(t) is called a Green’s function or weighting function for L(x) and 
is denoted by K(s, t). 

Let L, denote differentiation with respect to ¢ by the operator L. 
It is clear that K(s, t) exists and is unique; for overa<t<s, K(s,t) = 
0; and over s<t<)b, K and its first n derivatives are continuous and 
satisfy L,(K) = 0, whence K is uniquely determined by the boundary 
conditions (2) and (3). 

Lemma 1. K(s, t) and its first (n — 2) derivatives with respect to t are 
continuous in s and t simultaneously, fora<s<b,ax<t<b. The same 
is true for K"—\s, t) and K,!"\(s, t), except that these functions may be 
discontinuous where s = t. 


Proof. For s > t, K(s, t) = 0 and the lemma is trivial. For s < t, set 
afs,th= KU Us,1), f=1,2,-++,0 


Then by a change of variable the x’s are seen to satisfy the system 


dx 
i 
(4) 
AL n—1 —_ 
ger 
dx 
= —p,(t+s—a)t,—-+ ++ —p(t+s—a)z, 


dt 
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fora<t<b + a-— 5, with the initial conditions 


x(a) = 0, i=1,2,---,n—1; x,(a) = — 1 
Set 
X(s, t) = {x(s, 1),° + +, ZA(s, t)} 


Let s, < s_ be any two values of s, and set s = 5, in (4). Then X(sq, t) 
satisfies (4) with a certain error e; and since the p’s are continuous and 
the x’s all bounded, it is easy to see that lim « = 0, uniformly in#. Hence 
8-8; 
by the fundamental convergence theorem, lim X(sq, t) = X(s,, t), uni- 
88 
formly in t. That is to say, X(s, t) is continuous in s and ¢ simultaneously, 
s<t. Since K(s,¢t) and its first (2 — 2) derivatives are continuous 
also at s = t, they are continuous everywhere ina<s<b,a<t<b, 
and Lemma | follows. 
We now can prove our main result; 


Theorem 13. The solution x(t) of the nonhomogeneous equation (1) with 


initial conditions 
alia) =0, i=1,2,-- +n (5) 


is given by 
b 
a(t) = { K(s, t)q(s) ds (6) 


Proof. By Lemma | the integral (6) exists. Since the derivatives of 
K(s, t) with respect to ¢ up to the (n — 2)nd order are continuous in s, we 
may differentiate under the integral sign (n — 1) times with respect to ¢, 
obtaining 


b 
x U-U¢r) =[ Ks, t)q(s) ds, J=l,2,°++,0 (1) 


The fact that K,"-1\(s, t) has a single discontinuity at s = ¢ does not of 
course impair the validity of the last step. Since K,"-N(s, t) =0, 5 > 1, 
we may also write 


gilt) = [ "K-35, t)q(s) ds (8) 


Since the integrand is now continuous over the whole range of integration, 
we may differentiate again by the usual rule, obtaining 


ainl(t) = [ KEN, t)q(c) ds + Kis, t)q(s)],-1-0 () 
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Since 
Kis, t)},-,-9 = KOU, Oh_w9 = —1 


by (3), we may write (9) in the form 
200) = [KON, Nao) ds — a0 (10) 
From (7) and (10) we obtain 
La) = [ LAK) ds — qi) = G0), a<t<b aD 


Hence x(t) everywhere satisfies the nonhomogeneous equation (1). 
From (7) the boundary conditions are obviously satisfied, and ~ is the 
required function. 

We can give a physical interpretation of Green’s function in the follow- 
ing way. Suppose (1) represents the motion of a dynamical system, and 
q(t) an external force applied to it at time ¢t. Suppose we let g(t) be a 
unit impulse; i.e., let g(t) be 0 except for a small interval tf) -d<t< 


b | 
ty + 6, but et | q(t) dt = 1 (Fig. 1). Then by (6), x(t) is practically the 


a 
same as K(fo, ¢); i.e., K(s, t) represents the response of the system at time ~ 
t to a unit impulse applied at time s. This makes it obvious why K = 0, 
t<s. 


a(t) 


Fig. 1 


If now we conceive of any force function g(t) as being made up of a sum 
of such elementary impulses, the integral (6) says that the response of the 
system at time ¢ to the force function is obtained by adding its responses 
to all the elementary impulses at time s, for all s < t. 
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PART C. 
LINEAR SYSTEMS WITH CONSTANT COEFFICIENTS 


12. Introduction. Complex Solutions 


We consider the linear homogeneous system whose coefficients are 
real constants 


Se Sag: Welles 
t j=1 
i.e. (1) 
dX 
a = SX 


This is a special case of the general linear system of Part A. Hence the 
solutions of (1) are defined for all real values of ¢, and it is easy to verify 
that they possess derivatives of all orders. 

The essential difference betwecn (1) and the general linear system lies 
in the fact that if a vector X(t) is a solution of (1), then its first derivative 
X'(t) is also a solution. This follows immediately from the relation ~ 


d dX 
1 CO=H4s (2) 


sf being constant. Hence the derivatives of any order, or any linear 
combination of derivatives, are also solutions. We shall always make use 
of the condition that ./ is a constant matrix in this way. 

It will be convenient in what follows to consider not only real but also 
complex solutions X(t) of (1). (The variable ¢ will always be considered 
real.) If X = (a, 2,° - *, %,) is any complex vector, let us define the 
real and imaginary parts of X as 


R(X) = {R(%), R(%),* * +, R(Xq)} 
W(X) = {H(a6 1&2), * + +, 1(e,)} 
We have immediately Lemma 1: 


Lemma 1. X(t) is a solution of (1) if and only if both R(X) and I(X) 
are solutions of (1). 


(3) 


We define linear independence of complex vectors and functions exactly 
as in the real case, except that complex constants c; are allowed. It is 
clear that if a set of real vectors (or functions) is linearly independent in 
our previous sense it remains so if complex constants are allowed. All the 
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statements of Part A hold without change when complex solutions of the 
system are admitted. 


Theorem 14. From a set of n linearly independent complex solutions 
X(t), X2(t),° + +, Xp(t) of (1), there may be constructed a set of n linearly 
independent real solutions of (1), which therefore constitute a fundamental 
system. 


Proof. Let ty be any value of t, and consider the set of constant vectors 
X(t), i= 1,2,: + +,n. By Theorem 2 this set is linearly independent. 
Hence among the (27) real vectors 


R{Xf%)}, AX}, f= 1,2,¢ ++, 0 (4) 


precisely n are linearly independent. For if all the vectors (4) could be 
expressed in terms of some (7 — 1) vectors, so could X,(t%), i= 1, 
2,°° *,m. Hence among the functions 


RX}, HX}, i=1,2,-0 57 (5) 


precisely n are linearly independent; and since by Lemma 1 they are 
solutions of (1) the proof is complete. 

We are therefore entitled to say that (1) has been completely solved 
when a set of m linearly independent complex solutions is known. We 
shall show in the following sections that such a set may be obtained by 
purely algebraic means. 


13. Characteristic Values and Vectors of a Matrix 


In this section and the next we shall introduce the algebraic ideas 
necessary for the present discussion. Let .& = [a,,] be ann x mn matrix 
of real constants. A characteristic vector of & is defined to be any real 
or complex vector X different from 0 such that Y = &X is a multiple of X 


AX =AX, or(W —1é)X =0 (1) 


where A is some real or complex constant. A is called a characteristic 
value of the matrix .<, and X a characteristic vector belonging to 2. In 
terms of components (1) becomes 


(4, — Ajay + Agta tees + AyyX, = O 
Gy %, + (Ag — Alte + ++ + Qg,%, = 0 (2) 


Any % + Ongle t+ ++ + (Gy, — Ax, = 0 
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The system (2) will admit a nontrivial solution (a, %,° + °, 2,) only 
for those values of 4 for which the determinant of the coefficients vanishes 


a, —A Ae aoa An 
aqy Beg —A ++: Gen 
weiwi-| 0 5 tt ft feo @ 
any ane 8 Ann = A 


A number A will be a characteristic value of if and only if it satisfies 
the equation (3), which is therefore known as the characteristic equation 
for %. Since (3) is an algebraic equation of degree n, it follows that 
cannot have more than n different characteristic values. If A is an m-fold 
root of (3), it is said to be a characteristic value of multiplicity m. From 
the fundamental theorem of algebra it follows that ./ has precisely n 
characteristic values, each different one being counted as many times as 
its multiplicity. 


Theorem 15. Let X;, Xo,° °°, Xp be m characteristic vectors of A 
belonging respectively to the distinct characteristic values A, Ag,° * *, Am: 
Then the X’s are linearly independent. 


Proof. The theorem is trivial for m = 1, since a characteristic vector 
cannot be 0. We give a proof by induction. Suppose the theorem true 
for m = k — 1, and let X,, X2,- - °, X, be characteristic vectors of 
belonging respectively to distinct characteristic values A,, A.,° °°, Ay. 
Suppose there exists a nontrivial linear relation 


>) c,X,=0 (4) 


k 
2 cAiX, ;=0 (5) 


We may assume without loss of generality that at least one of the 
constants cy, Co,° * *, Cy_y is different from 0. Multiply (4) by 4, and 
subtract (5) to eliminate c,A,X,. This gives 


k-1 
> c(A, — 4)X, = 0 (6) 


i=1 
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Since 4, — A, #0, (6) is a nontrivial relationship contrary to the in- 
duction. Hence the relation (4) cannot be nontrivial. This completes 
the induction. 

In particular, suppose that there exist n distinct characteristic values 


Ay, 4q,° * *, An (1.€., all the roots of the characteristic equation are simple). 
Then the system (2) admits of essentially only a single solution X, = (vj, 
Lio,’ * *» Lm) for each A; and the n vectors X,, X5,°°°, X,, so defined will 


be linearly independent. 


14. Vectors Associated with Characteristic Values of a Matrix 


We have just seen that if the characteristic equation has only simple 
roots, then there exists a set of n linearly independent characteristic 
vectors of x. To handle the case where the characteristic equation has 
multiple roots, we introduce the following definition: 


Definition 2. Let A be a root of the characteristic equation |~ — 1€| = 0. 
Then a complex vector X different from 0 is said to be a vector associated 
with A of multiplicity m, if for some integer m (and no smaller m) 


(sf —1€)"X¥ =0 (1) 


If m = 1, X is a characteristic vector belonging to A. It can be shown 
that no vector can be associated with two different characteristic values. 

We state without proof the following theorem of algebra; it follows 
from the general theory of elementary divisors, which is treated in any 
text on the algebra of matrices. 


Theorem 16. Let Ay, Ag,° * *, A, be the distinct roots of |S — 1é| = 0, 
k 


with respective multiplicities m,, m,° * *, M, where > m,=n. Then 
i=l 
corresponding to each i, there exists a set of precisely m, vectors X,; 


(j = 1, 2,+ - +, m,) such that 
(1) X;; (GG = 1, 2,- + +, m,) is associated with 4, of multiplicity < m,. 


(2) The set of n vectors X,, (j= 1, 2,° + +,m,; i= 1, 2,° + +, k) is 
linearly independent. 


In particular, if all the multiplicities are 1 (k =n), this reduces to 
Theorem 15. 

This completes the algebraic preliminaries. We shall now consider the 
problem of finding a fundamental system of solutions of dX/dt = WX. 
Our general result will be this: if all the characteristic values of are 
known, a fundamental system of solutions can be found by a finite number 
of elementary operations (i.e., solution of linear algebraic systems). 
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15. The Solution in the Simplest Case 
In this section we shall show how a fundamental system of 
— =X (1) 


may be constructed, on the assumption that the roots of the characteristic 
equation are all simple. In this special case we shall not need to assume 
Theorem 16 in its general form. 

Our first two theorems (17 and 18) are independent of the nature of the 
roots of the characteristic equation. 


Theorem 17. Let X(t) be a solution of (1) for all t. If, for some value 
ty of t, X is a characteristic vector of S belonging to the characteristic 
value A 


A X(t) = AX (ty) (2) 
then the same is true for every value of t 
A X(t) = AX(t) (3) 
Proof. Consider the function : 
¥) = 29 _ axw (4) 
dt 

It is a solution of (1). We may also write 

Y(t) = MX(t) — AX(t) = (WH — IE)X(t) (5) 


But since 
Y(t) = (af — AE) X (to) =0 


by hypothesis, it follows from the uniqueness theorem that Y(t)=0. In 
view of (5) the theorem follows. 

It therefore makes sense to speak of a “characteristic solution” X(t) 
of (1), i-e., a solution for which (3) holds for all t. Theorem 17 then 
becomes: a solution X(t) of (1) is a characteristic solution belonging to 
A if and only if, for some value f) of t, X(t)) is a characteristic vector 
belonging to A. 

The importance of the characteristic solutions is that they may be ex- 
pressed in a particularly simple form: 


Theorem 18. Corresponding to every characteristic value 4 of &, there 
exists at least one characteristic solution X(t) of (1), which may be written 
in the form 

X(t) = (wye", xe", > - +, 2,€%) (6) 


where (x1, X2,* * *, %q) is a characteristic vector belonging to A. 
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Proof. Let A and one of its characteristic vectors X, = (1, %,° * *, %) 
be given. Let X(t) = {x,(t), x,(t),- - :, x,(t)} be the solution of (1) such 
that X(0) = Xo. Then by Theorem 17, X(t) satisfies 


dX(t) 


—— = DX = 1X 
= C) (7) 
Writing (7) in terms of its components, we have 
| ee = Azt), i=1,2,+ +--+,” (8) 
Hence 
x(t) = c,e** 
and since 


x(0) = c; = &, 


X(t) is precisely of the form (6). 
From this our first main result follows immediately: 


Theorem 19. Any set of characteristic solutions of (1) belonging to 
different characteristic values is linearly independent. If in particular & 
has n different characteristic values, the corresponding set of characteristic 
solutions forms a fundamental system of (1). 


Proof. Taking for ¢ any particular value ft), the constant vectors so 
obtained are linearly independent by Theorem 15. 

Observe that we might equally well attempt to find a solution of (1) in 
the form 


X(t) = (cye**, eye", + + +, Cyl“) (9) 


Substituting (9) into (1) the exponentials cancel, and the condition that 
(9) be a solution is that the c’s satisfy a certain linear algebraic system. 
This will have a nontrivial solution if and only if A is a characteristic value. 
This is essentially the method we shall use in the general case. 

Observe that the fundamental system we have obtained will in general 
consist of complex solutions, the /’s being in general complex. By 
Theorem 14 this may be reduced to a real fundamental system. 


16. The Solution in the General Case 


We shall show in this section that there exist what we shall call primitive 
solutions of 


—=WAX (1) 
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corresponding to the vectors of » associated with the characteristic 
values A. We shall show that these solutions may be expressed in a certain 
simple form; and from Theorem 16 it will follow that they constitute a 
fundamental system of (1). 

It will be convenient. to introduce the differential operator D = d/dt, 
and to write symbolic polynomials in D according to the scheme 


D*x means zt 
dt* 
d? d 
(aD? + bD + c)x means a 4 +b = + cx (2) 


(D — a)(D — b)x means [D? — (a + 5)D + abjx 
If X(t) is a solution of (1), we know (§ 12) that DX is also a solution and 


DX = AX 
from which there follows immediately 
DEX = A*X 


with k any integer. If therefore P is any polynomial with constant 
coefficients, we have the relation 


P(D)X = P(of)X (3) 
for any solution X(t) of (1). 

Theorem 20. Let X(t) be a solution of (1); and suppose that for some 
value ty of t, X(to) is associated with a characteristic value 4 of & with 
multiplicity m 

(Sf — IE)" X(t) = 0 (4) 
Then the same relation is true for all values of t 
(SY — 1)" X(t) = 0 
and also (5) 
(D — Ay" X(t) = 0 

Proof. It follows from (3) that 

(WS — AE)" X(t) = (D — A™X(t) (6) 


Let us now set 
Z(t) = (D — Ay" X(t) 


Then Z(t) is a solution of (1), being a linear combination of X(t) and its 
derivatives. But since 


Z(t.) = (of — I€Y"X(tp) = 0 | (7) 
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by hypothesis, it follows from the uniqueness theorem that Z(t) = 0. 
This completes the proof. 

Let us now define a primitive solution of (1) associated with the character- 
istic value 2 of multiplicity m as any nontrivial solution X(t) of (1) for 
which there exists a characteristic value A of and an integer m such that 
(5) holds identically. Theorem 20 then becomes: X(t) is a primitive 
solution of (1) if and only if there is some value fy of ¢t for which X(ty) is 
a vector associated with the characteristic value A of multiplicity m. 
The primitive solutions of (1) may, like the characteristic solutions, be 
written in a simple form: 


Theorem 21. If X(t) is a primitive solution of (1) associated with the 
characteristic value A of multiplicity m 


(Sf — AP)" X(t) = 0 (8) 
then X(t) is of the form 


X(t) = {P,(te, Pte", ne P,(t)e*} (9) 


where P(t) (i = 1, 2,° + +, ) are polynomials in t of degree <m— 1. 
Proof. By Theorem 20 we have 


(D — A" X(t) = 0 (10) 
Writing (10) in scalar form, where 
X(t) = {x,(*), x(t), my 2,(t)} 


(D—A"2(Q=0, i=1,2,°°+7 (11) 


we have 


Now for any complex function x(t) with continuous derivatives of mth 
order, we observe that 


(D — Ay™[e%a()] = e# D™[x(1)] (12) 


with 2 any constant. Equation (12) is trivial for m = 0; assuming (12) 
for m = k — 1, we have 


(D — )*e*a(t)] = (D — AD — Ay "e*a(2)] 
= (D — dA[e*D*-\(z)] = e* D*(x) 
Hence (12) is proved by induction. Now since (11) may be written as 


(D — A)™[e**{e**x,(1)}] = 0 
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by (12) we obtain 
e** D™{e-*ta(t)} = 0 (13) 


Canceling e** in (13) and integrating, we have 
a(t)=eMP(t) i= 1,2, + +n (14) 


where P,(t) (i = 1, 2,° - +,m) is a polynomial of degree < m— 1. This 
is precisely (9), and the proof is complete. 
Our general result now follows: 


Theorem 22. A fundamental system of solutions of (1) of the form (9) 
always exists; and if all the characteristic values 4, of the matrix are 
known, this fundamental system may be obtained by a finite number of 
elementary operations. 


Proof. The existence is trivial; for by Theorem 16 there exists a linearly 
independent set of n vectors Xj. of < associated with its characteristic 
values. If we define solutions X,(t) of (1) satisfying the initial conditions 
Xtc) = Xj, then these will be linearly independent and of the form (9). 

Let new A,, A,,° * +, A, be the characteristic values of 7, with respective 
multiplicities m,, m,° °°, m,. Let A be any one of them, with multi- 
plicity m. We shall try to satisfy (1) by functions of the form 


X(t) = {Pa(the, Pio(te*, > > +, Pin(t)e™} (15) 


where P,{t) are polynomials of the ith order with undetermined co- 
efficients, where i<i m— 1. First take i = 0, and substitute (15) in (1). 
The exponentials will all cancel and we get a set of linear algebraic equations 
in the undetermined coefficients of the P’s. This system may have no, 
or one, or several nontrivial linearly independent solutions. Performing 
the same process for i = 1, we equate coefficients of ¢ and constants to 
obtain 2n linear equations in the 2m undetermined coefficients. This 
system again will have a certain number of linearly independent solutions. 
Proceeding in this way up to i = m — 1, our theory proves that we shall 
obtain at least m solutions of which precisely m will be linearly independent. 
Then if we do the same for the other A4’s, we shall have constructed: a 
fundamental system. Thus we have a perfectly definite method by which 
a fundamental system of (1) may be obtained, and the process involves 
nothing more difficult than the solution of linear algebraic systems. This 
completes the proof. 

It is easily seen that the process just described is equivalent to finding 
n linearly independent vectors associated with the characteristic values of 
J. Now Theorem 16 may be made considerably more precise, and the 
process of finding the required vectors will be shortened thereby. This, 
however, would involve more algebraic theory than we can go into here. 
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17. Homogeneous Equation of nth Order 


We consider now the equation 


aa. &. de 
dt” + 25 dt"—* =e (1) 
where the a,’s are constants. We could reduce (1) to a linear system and 
treat it by the methods of § 16; it is, however, much simpler to treat (1) 
separately, and in addition we shall not need to assume Theorem 16. 
The relation to the methods of § 16 will be very obvious. 
Equation (1) may be written in the form 


P(D)x = 0 (2) 


where P(D) is a certain polynomial. The characteristic equation of (1) is 


P(A) =0 (3) 
Let A,, 42° * *, A, be its roots, where 
P(A) = (A = Aya — Ag) + (A — Ay) (4) 
and 
k 
> m,=n 
i=1 
We may apply the same factorization to P(D) and write (2) as 
(D — Ay"(D — 2,)™+ + + (D — A)*x = 0 (5) 


Now it is clear that any function 2(f) satisfying 
(D — A,)"x = 0 (6) 


will also be a solution of (5). We have already integrated an equation 
of the form (6) in Theorem 21; and the most general solution of (6) is 


w(t) = Pp, _y(t)e*# (7) 


where P,(t) is a polynomial of degree k. Henge in particular the 2 
functions 


ase, j=l, 2--m, i=1,2°+k (8) 
are all solutions of (1). 


Theorem 23. The n solutions (8) of (1) are linearly independent and 
hence form a fundamental system of (1). 
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Proof. Any linear combination of the functions (8) can be written in 
the form 


k 
Dd Pite* 
i=1 
where A, 4A, if ij, and Pt) are polynomials in ¢. It will suffice 
therefore to prove that if 
k 
Y P(He =0 (9) 
i=1 


then 
Pt) =0, i= 1,2,-°°,k 


We give a proof by induction. The theorem is trivial for k = 1. 
Suppose it true for k = m— 1. Consider then the identity 


SPae'# =0, A, 44, if 1 4) (10) 
Let us divide by e*'. Then (10) becomes | 
"S Payee-tat + P,,(t) =0 (11) 
where A; — 1,, 40, i= 1,2,:--+:,;m—1. Differentiating (11) we have 
"S 4 + Pal) = 0 
where Q,(t) = P;'(t) + (A; — 4,,)PAt). Hence the Q’s are polynomials 


of the same degree as the P’s. After differentiating a suitable number of 
times, the term in P,,(t) will disappear and we have 


m—1 
Y Ri(te4s—*mt = 0 (12) 
t=1 


where by the same argument the R,’s are polynomials of the same degrees 
as the P’s. But since no two of the exponents (A; — 4,,) are equal, our 
induction applies, and all the R’s are identically 0. Therefore the P’s, 
which were of the same degree as the R’s, also are identically 0. The 
same of course holds for P,,(t); and this completes the induction. 


18. Applications 


As a simple example of our methods, we consider the equation 


2 


Ga tka + ote = FO) a) 
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where k and w are real constants. Equation (1) may be interpreted as the 
expression of Newton’s law for a particle with one degree of freedom, 
where 


(1) w*x represents a restoring force towards the position x = 0 pro- 
portional to the displacement. 


(2) 2k(dx/dt) (for k > 0) represents a damping force proportional to 
the velocity. 


(3) F(t) is a variable external force. 
If F(t) = 0, (1) becomes homogeneous and the characteristic equation 
is 
A? + 2k1A + w? = 0 (2) 
The cases where the roots of (2) are equal or one is 0 are of comparatively 


little physical interest. Otherwise, if k? > w?, the roots are real and 
distinct 


A= —k + VFB @, = —k— VP — wo 
and the general solution is 
= ce"! + cae! 
If k? < w*, the roots are complex 
A=—k+if, where f? = w* — kK? 
and the general solution is 


x = e*"(c, cos Bt + cy sin Bt) (3) 


By means of these results, we shall obtain the solution of the non- 
homogeneous equation (1), using both of the methods we have developed. 
First, using the method of variation of constants, we know that a particular 
solution of (1) may be written in the form 


% = ¢,(t)2, + co(t) Xe (4) 
where x, and x, are a fundamental system of the homogeneous equation 
“,=e "cos Bt, x = e-*sin Bt 

and c,(t) and c,(t) satisfy 
cy (ta, + (tz, = 0 


‘ ‘ ‘ , (5) 
Cy (t)ay' + cg (tty = F(t) 
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Solving (5) we have 


F 1 
q(t) = — ReD =— a sin BtF(t) ‘. 
6 
Co (t) = a5 = 50 cos BtF(t) 


Let us integrate (6) and substitute in (4). Then after some simplifications 
we obtain the particular solution of (1) 


x(t) = 5 i eke sin B(t — s)F(s) ds (7) 


Let us now construct Green’s function K(s, t) for (1). We know 
K(s, t) =0 fort<s 
K(s, t) = e-*“(c, cos Bt + c,sin Bt) fort>s 
[K(s, t),-s40 = 9, [Ki'(S, )hasto = —1 
The constants c, and c, are uniquely defined by the initial conditions 
1 1 
Cc, = = (e* sin Bs), cg = — (e*™ cos Bs) 
B —B 
Hence after some simplifications K(s, ¢) is seen to be 


K(s, t) = 0, t<s 


K(s, th = — sow sin B(t — s), t>s ®) 
Now the solution x(t) of (1) such that 
x(to) = 0, x'(t) = 0 
is known to be given by 
x(t) = -[ K(s, t)F(s) ds (9) 


But (9) is exactly the same as (7); thus our two methods have produced 
precisely the same solution. 

Observe that K(s, t) is here a function of t—salone. This is true in 
general for the Green’s function of any nth order equation 


L(x) + F(t) =0 
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provided the coefficients in L are constant. For if x(t) is a solution of 
L(x) = 0, so is x(t — t)). Hence we can write 


K(s, t) = K(s —s,t —s) = H(t —s) (10) 


In terms of our physical interpretation, H(t) represents the response of the 
system at any time to a unit impulse ¢ units of time previous. In our case 


H(t) = — j —* sin Bt 


Thus the magnitude of the response either approaches 0 or becomes 
unbounded as f¢ increases, depending upon the sign of k. 


Singularities of 
an Autonomous System 


PART A. 
INTRODUCTION 


1. Characteristic Curves 
We consider in this chapter a second-order system of the form 


dx dy _ 


Such a system, in which the independent variable ¢ does not appear 
explicitly in the functions on the right-hand side, is called autonomous. 
Let us assume that P(x, y) and Q(z, y) are defined in some domain D 
of the x-y plane, and satisfy a Lipschitz condition on both x and y in some 
neighborhood of every point of D. Then by the theory of Chapter 2, if 
fy is any number and (2p, Yo) any point of D, there exists a unique solution 
of (1) 
z=2t) y=~y(t) 

satisfying (2) 


U(ty) = 2%, Yb) =Y 
70 
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Furthermore we know that either 
(1) The solutions (2) are defined for all real values of t; or 
(2) If the solutions (2) are not defined for t > ¢, (say), then either 
(a) As t->t, — 0, the point (a(t), y(t)) approaches the boundary of D; 


or if D is unbounded, possibly 
(b) As t >t, — 0, either x(t) or y(t) or both become unbounded. 


Analogous alternatives arise if t is not defined for t < some number f. 
In any subdomain D’ of D in which P(, y) does not vanish, we may 
write (1) in the form 
dy _ Q(z, y) 
Se (3) 
dx P(a,y) y) 


This is simply a restricted direction field of the type studied in Chapter 1; 
whence we know that through any point of D’ there passes a unique 
integral curve of (3). Essentially the same theorem is true for the more 
general system (1), as we shall see immediately. 

Let us consider a solution (2) of (1), with the restriction that a(t) and 
y(t) are not both constants; this case will be treated later. Then (2) 
defines a curve with continuously turning tangent, which we call a char- 
acteristic curve or simply a characteristic of (1). We have immediately: 


' Theorem 1. Through any point of D there passes at most one 
characteristic. 


For let (2,(t), y:(t)), (%e(t), ye(t)) be solutions of (1) representing 
characteristics C, and C, of (1), with the common point (%, y). Then 
there exist values ¢, and ft, of ¢ such that 


(ty) = y(t) = %, alt) = Yalta) = Yo 
Consider the pair of functions 
(a(t + 4 — 4), w(t + tr — 4) (4) 


Then (4) is a solution of (1), since P and Q do not contain t i ca 
But since for t = ft, we have 

mit+h—-)=2(t), wlt+4—t) = y(t) (5) 
it follows from the uniqueness theorem that (5) holds identically in ¢. 
Since (4) represents the characteristic C,, C, and C, are identical, and the 
theorem follows. 


Of course, the theorem is not true in general unless (1) is utononipaa: 
as may easily be shown by examples. 
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It should be understood that the terms characteristic and solution are 
not synonymous; a characteristic is a curve in D, which may be represented 
parametrically by more than one solution of (1). For example, if (z(t), 
y(t)) represents a certain characteristic C, then for any f, (a(t — t), 
y(t — t))) is a different solution also representing C. It is clear from 
Theorem 1 that a// solutions representing C may be written in this form 
by a suitable choice of ft). Hence it follows that for every solution of 
(1) representing C, the direction of increase of t along C is the same. 
Therefore a characteristic is a directed curve; in our figures we shall 
represent the direction of increasing ¢ by an arrow. 

If there exists a solution of the form 


X(t) = Xp, y(t) = Y% (6) 


it is clear that no characteristic can pass through (2%, Yo). The point 
(Xp, Yo) is said to be a singularity of the system. The condition that (6) 
be a solution of (1) is that 


P(X, Yo) = 9, Q(X; Yo) = 0 (7) 


Conversely, if (7) holds for a point (2, yo), it is obviously a singularity. 
Therefore the singularities of (1) are the points at which P and Q both 
vanish; they are the only points of D through which no characteristic 
passes. All other points of D are said to be regular. 

We can interpret the situation kinematically in the following way. 
Consider the vector field defined by 


V(x, y) = (P@, ¥), QC, y)) (8) 


ie., at (x, y) the vector V(x, y) has a horizontal component P(z, y) and a 
vertical component Q(x, y). Then (1) defines the motion of a particle 
(x, y) in the plane by the condition that its velocity be at every point equal 
to the vector V(z,y). Then the characteristics of (1) represent the 
trajectories of the particle, i.e., those fixed paths along which it must 
move, independently of its starting point. The singularities represent its 
points of equilibrium, i.e., those points at which it is at rest. In this 
chapter we shall consider the type of equilibrium defined by certain simple 
kinds of singularities of the system (1). 

Our theorem implies as a corollary that no characteristic can cross 
itself. For if C: (x(t), y(t)) is a characteristic and there exist two values 
ft) and ¢, of ¢ such that 


X(to) = X(t), Yy(to) = y(t) (9) 
it follows that for all ¢ 
x(t, + t) = xt + 0), Y(t, + t) = y(t + 0) 
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That is to say, C is a closed curve. C will have a period h, where h is the 
least number such that 


at+h=cxt), ytth=y) 


The existence of closed characteristics will prove to be of fundamental 
importance in the theory of characteristics in the large of Chapter 5. 


2. Isolated Singularities 


From the results of § 1, we know concerning any characteristic C: 
(x(t), y(t)) that either: 


(1) x(t) and y(t) are defined for all real values of t; or 


(2) If a(t) and y(t) are undefined for t >), then as t>t, — 0 the 
point P: (x(t), y(t)) either approaches the boundary of D or becomes un- 
bounded. It is clear that these results are independent of the particular 
solution (x(t), y(t)) used to represent C. 


Let us now define a half-characteristic as the set of points of a char- 
acteristic C such that, for some representation (~(t), y(t)) of C, t > some 
number f). Then we have from the above a lemma: 


Lemma 1. If a half-characteristic is contained within a closed bounded 
region R in D, its representing solution is defined for all t > ty. This lemma 
will be of constant use in the next chapter. 


Consider now an isolated singularity (Xo, Yo) of 


dx dy 

dt a P(2, Y)s dt — O(x, y) (1) 
i.e., suppose there exists a circle (v — 2)? + (y — Yo)? < p? containing no 
singularity of (1) except (%p, ¥). For convenience, take (2p, yg) to be the 
point (0, 0). Suppose there exists a half-characteristic C: (z(t), y(t)) 
lying in some closed bounded region R that contains (0, 0) as its interior 
point. Then a(t) and y(t) are defined for all ¢ >t. We say that C 
approaches the singularity (0, 0) if 


lim z(t) = 0, lim y(t) = 0 
t+ 2 t+ 
or equivalently if 
lim [x*(t) + y*(e)]* = 0 (2) 
t+ 


In terms of point-set theory, the derived set C’ of C is the single point 
(0, 0). It is hardly necessary to state that the fulfillment of (2) does 
not depend upon the particular solution (z(t), y(t)) that we happen to be 
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using. In general, wherever we define properties of a characteristic by 
means of a representing solution, it will be understood that the property 
is independent of the solution chosen. 

If a half-characteristic C approaches the singularity, we shall say further 
that it enters the singularity if the radius vector from (0, 0) to a point of 
C has a limiting direction as t > 00; ie., if 


lim — (3) 


exists or diverges properly to +00. A sufficient condition that C enter 
the singularity is that the direction of the tangent to C at (x, y) have a 


(t 
limiting position as t-> 00; for by l’Hospital’s rule, if lim os exists 
t— 2 
; t : 
(or = +0) then lim e exists and has the same value. For most 
t+ 


singularities in which we are interested, this is a necessary condition as 
well (cf. § 10). 

In the preceding definitions, we may also consider limits as t > —, 
and consider the behavior of negative half-characteristics in the neighbor 
hood of a singularity. We shall in fact for mathematical purposes 
frequently replace t by —t; though not forgetting that for the physicist-t 
is generally time. 

We have already mentioned that a singularity of (1) may be represented 
as a point of equilibrium of a kinematical system. We wish to formulate 
precisely the idea of stability of such an equilibrium. This may be done 
in more than one way; the following definition is due to Liapunov: 


Definition 1. Let P: (2%, Yo) be an isolated singularity of (1). P is said 
to represent a state of stable equilibrium if corresponding to every number 
€ > 0 there exists a number 6 > 0 with the following property. Let C: 
(x(t), y(t)) be any characteristic satisfying 


V (X(t9) — Xo)? + (Y(to) — Yo)? < 6 (4) 
for some value f, of t. Then C is defined for all t > t) and 
V(a(t) — mF + YO -—HP <6 thSt<o (5) 


Roughly, we may say the condition is this: once a characteristic gets very 
close to the singularity, it stays close to the singularity. If every character- 
istic passing through some point of a certain circle 


(tr — XH) + (YY — YH)? < Pp? 


approaches P, then clearly the equilibrium is stable. 
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In this chapter we shall classify the singularities of the system (1) in 
terms of the Jocal behavior of the characteristics near them. To do this 
we shall first consider the case where (1) is a Jinear system. We shall then 
prove that in the general case we may under certain conditions replace 
P(x, y) and Q(x, y) by the linear terms of their Taylor expansions about 
(Xp, Yo), Without any qualitative change in the local geometric behavior of 
the characteristics. 


PART B. 
SINGULARITIES OF A LINEAR SYSTEM 


3. General 


Consider the linear system 


dx —~ dy 
Hoe + Oy yet (1) 


where a, b, c, and d are constants. The system (1) has a singularity at 
(0, 0); let us assume that it has no other singularity; i.e., that 


ab 
cd | ou 
The characteristic equation of (1) is 
a—A b | _ 4 oe 
? d—- = A? — (a+ d)A + (ad — bc) = 0 (2) 


Observe that 0 cannot be a characteristic value. We know how to 
determine the solutions of (1) from the roots A,, 4, of (2). We shall prove 
that the nature of the singularity (0, 0) of (1) is completely determined by 
the values of A, and A,. 

It will frequently be convenient to introduce new variables %, 7 by the 
affine transformation 


x= px 
: Pet gy where |? {| 40 (3) 
Y¥ =rx+ sy rs 
We shall then obtain a new system of the form 
di 
Ss AG 7 
dt eee A B 
where #0 (4) 
dy S 7 C D 
= Cx + Dy 


dt 
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It is well known that (4) will have the same characteristic values as (1). 
Indeed, in terms of matrices, 


fe ol- le alle all 


It is easy to see that all the properties of characteristics in which we are 
interested (their approaching or entering a singularity, their being closed 
or bounded, etc.) are invariant under affine transformation. The trans- 
formation will only have the effect of somewhat distorting the shape of 
the characteristics. We shall in every case choose the constants of (3) so 
that (4) will have as simple a form as possible; (4) will then be a canonical 
form of (1). 

We shall proceed to classify the singularities of (1) according to the 
following scheme: 

Case |. A, and A, are real and unequal. 

(a) A, and A, have the same sign. 


(b) A, and A, have different signs. 


Case Il. There is a double root. 


(a) The rank of (" . . 4 2 :) is 0. 


(b) The rank of 4 - : ae i) ie 
Case Ill. A, and A, are conjugate complex. 
(a) The real part of the ’s is not 0. 

(6) A, and A, are purely imaginary. 


4. Nodal Points 


We consider first Case I(a), where A, and A, are real, unequal, and of the 
same sign. For definiteness let us take A, << A, <0. (The case where 
the 4’s are positive will be obtained by substituting —¢ for ¢.) We shall 
first consider the general system 


dx dy 
Ao th oe EY (1) 


without reducing to canonical form. Then, as may be verified by sub- 
stitution into (1), two particular solutions are given by 


2, = Ae, y, = Be 


at, = Cer!, y, = Der 


At 


(2) 
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where A, B, C, D are definite constants and E = #0. The char- 


C.D 
acteristics represented by (2) are half-lines from the origin with different 
slopes. If we multiply the solutions (2) by —1, we have two more half- 
lines and the four characteristics so defined form, together with the origin, 
two straight lines. 

The most general solution may be written in the form 


x = c,Ae™ + c,Ce™! 


3 
y = c,Be™ + c,De# ©) 


with c, and c, arbitrary. Clearly, all the characteristics approach the 
origin, and the rectilinear ones enter it with slopes B/Aand D/C. Further- 
more, if in (2) we take c, 4 0, we have 


D+ a Bel — ant 
Ce 


lim —- = li —__—_—__—_— ] = 


too Mt) tm) oF yay 
C2 


(4) 


als 


Therefore all the characteristics enter the origin, and all the nonrectilinear 
characteristics are tangent to a certain one of the rectilinear characteristics 
at the origin. We have then Fig. 1. 


(0,9) 


Fig. 1 
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It is easy to verify that for any nonrectilinear characteristic (3) 


iin 2 


t+-—a@ x(t) - A 


In (3) take c, and c, positive. Then every ray from 0 between the recti- 
linear characteristics C, and C, meets (3) once and once only. Now the 
curve obtained from (3) by a similarity transformation 


Ht)=kx(t), y(t) = ky) (5) 


with k any positive constant, is also a solution of (1). Hence by expanding 
and contracting any characteristic C of the form (3) we obtain all the 
characteristics in the region between C, and C,, and similarly for the other 
three regions of the plane. This will in general be true for linear systems: 
one nonrectilinear characteristic defines a// the characteristics in a certain 
region by means of the similarity transformation. 

The singularity represented by Fig. 1 is called a nodal point; more 

generally, a nodal point is a singularity entered by all characteristics that 
start sufficiently close to it. In the present case there are four rectilinear 
characteristics making up (together with the origin) two straight lines; all 
the other characteristics are tangent to one of these lines at the origin. 
The equilibrium is certainly stable. 
- If now we replace ¢ by —?, we obtain the case 4, > 4, >0. Then 
our work will go through as before, except that all the arrows in Fig. | 
will point away from the origin. We may call this an unstable nodal 
point. 

Let us now take any two vectors along the rectilinear characteristics as 
unit vectors for an oblique co-ordinate system %, 7. To be more explicit, 
let x = AZ + Cand y = BE + Dy. Then Z and @ are an affine trans- 
formation of x and y, and (1) becomes 


dz s ay 2 
Sah, = hg 6) 


the canonical form of (1). If we interpret and 7 as being perpendicular 
co-ordinates, we are simply making an affine distortion of Fig. 1. Now 
the general solution of (6) is 


£ = ce", y = ce” (7) 


where c, and c, are arbitrary. The four semiaxes are rectilinear 


SINGULARITIES OF AN AUTONOMOUS SYSTEM 79 


characteristics, and every nonrectilinear characteristic is tangent to the 
y-axis at the origin. Finally, from any nonrectilinear characteristic we 
may “‘fill up” the first quadrant by a similarity transformation. We then 
have Fig. 2. 


Fig. 2 


Evidently, it is easier to work with the canonical forms of (1). We 
shall do this in the future, and merely indicate what will be the changes 
in our figures for the general case. 


5. Saddle Points 


We consider now the case where 4, and A, are of opposite sign, say 
Ag <0< A,. Then as in § 4 we may consider our system in the canonical 
form 


= = Az, os (1) 


The general solution is 
a=qe y=ce* (2) 
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The four semiaxes are all characteristics, directed as in Fig. 3. If in (2) 
we take c, = c, = 1, for the characteristic so defined we have: 


ast 0,x%-—+- 0, y¥—>0 


as f-> — 00, 7-0, y-> & 


We may “‘fill up” the first quadrant and obtain Fig. 3. 


y) 


We thus have four rectilinear characteristics, asymptotes to all the others. 
If a half-characteristic starts on the y-axis, it will enter the origin along the 
y-axis; in any other case it will approach the z-axis at x = +00, Such 
a singularity is known as a saddle point; it represents an unstable equilib- 
rium. In the general case, the rectilinear characteristics will not be 
perpendicular, but the configuration will remain unchanged. 


6. Degenerate Nodal Points 


Although the case where the characteristic equation has a double root 
A is of less importance, we include it for the sake of completeness. We 
distinguish between two subcases. 
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(a) Suppose the rank of (" o A Pia ) is 0. Then b=c=0O, 


a = d = ji, and the system is already in a canonical form 


The solutions are all rectilinear characteristics 
zt=ce, y=c,e" 


which (if 4 < 0) all enter the origin as t > 00; see Fig. 4. 


Fig. 4 


This is a very simple nodal point. If 4 > 0 we have the same picture 
except that all the arrows point away from the origin. 

(b) Suppose the rank of (" . A ie ) is 1. Then there exists 
essentially a single solution of the form 


z= Ae, y= Be (2) 


Now it is known from the further theory of linear systems with constant 
coefficients (which we have not considered) that there is another solution 
of the form 

x=(At+Cye*, y= (Bt + De (3) 


From (2) it appears that there are two rectilinear characteristics forming, 
together with the origin, a single line. Any nonrectilinear characteristic 
may be written as a multiple of (3), and (if A < 0) 

y B 


lim 2 = — 


t+ wo & A 
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Therefore all the characteristics approach the origin tangent to one of the 
rectilinear characteristics. Since the characteristics are symmetric in 
the origin, we must have a picture of the form shown in Fig. 5. This is a 


Fig. 5 


degenerate nodal point. For 4> 0 we have the same figure but with 
arrows reversed. : 


7. Vortex Points 


Suppose now the characteristic values are complex. We may then set 
A, = a+ if, A, = a — if, where « = }(a + a), B? = —bc — [4a — d)}* 
> 0. By a suitable real affine transformation we may consider our 
system in the canonical form 


— = ax — By, = = px + ay (1) 


We consider first the case when « = 0, ie., A, and A, are purely 
imaginary. Then a solution of (1) is 


x=cosft y=sin£t (2) 


which represents a circle about the origin. By a similarity transfor- 
mation it appears that all the characteristics are circles with center at the 
origin. The picture in the general case will be an affine distortion of this, 
i.e., a family of ellipses with center at the origin, as shown in Fig. 6. 
Such a singularity is called a vortex point; in general a vortex point is 
one for which every characteristic in some neighborhood of the singularity 
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is closed and surrounds the singularity. Although no characteristic 
approaches the origin, it is clear from our definition that such a singularity 
is stable. 


Fig. 6 


Let the system again be written in the general form 


dx dy 

qm att hy ym cet dy (3) 
Then bc <0. Since for y = 0, dy/dt = cz, it follows that the direction 
of rotation of the characteristics is that of the sign of c, or the negative 
of the sign of b (counterclockwise rotation being considered positive). 


8. Spiral Points 


Suppose finally the characteristic values are complex but not purely 
imaginary (i.e., « 40). Then in the notation of §7 there is a solution 
of the form 

x = e* cos Bt, y = e* sin Bt (1) 


This is a logarithmic spiral about the origin, from which we may obtain 
all other characteristics by similarity. Such a singularity is known as a 
spiral or focal point. If « <0, it is clear that all the characteristics 
approach the origin as t — oo, but none enters it. The equilibrium is 
stable. 

If « > 0, all the spirals run away from the origin towards oo. Fora 
system in the form of (3) of § 7, the rule for direction of rotation is the 
same. The picture in the general case will therefore be that of a family 
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of affinely distorted logarithmic spirals, with the four possible con- 
figurations illustrated in Fig. 7. 


a<0 a<0 
e>0 ce <0 


a>0o a>o 
c>0 c <0 


Fig. 7 


- We have drawn only one spiral in each case. The reader will have to 
imagine the rest of the family of spirals in between the turns of the first 
one. 


9. Conclusion. Dynamical Interpretation 


We may then summarize our results in the following way: 
(1) A,, 4, real and of the same sign: nodal point. 

(2) A,, A, real and of opposite sign: saddle point. 

(3) A, A, purely imaginary: vortex point. 

(4) A, Ag complex: spiral point. 


As regards stability, by considering separate cases we arrive at the result: 
a necessary and sufficient condition that the equilibrium represented by the 
singularity (0, 0) of 


dx 
— = ax + by 
dt a b 
where #0 (1) 
dy j c d 
ape y 


should be stable is that both the characteristic values 4,, A, should have 
negative or zero real parts. 
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Before passing on to nonlinear systems, let us consider a special case 
of (1). In the second-order oe with constant coefficients 


d?x 
ae tue ~ + 4x =0 (2) 
let us as usual take @ = dx|dt as a new variable; we have then 
dx : dz : 
ao Be ao —gqu — 2kx (3) 


This is the special case of (1) when a=0, b=1. We assume q #0. 
If (2) is considered as defining the motion of a dynamical system, the 
characteristics of (3) represent the trajectories of (2) in the (x, 2) phase 
plane. The first equation of (3) shows that the characteristics of (3) have 
the following special properties: 

(i) A characteristic crossing the %-axis can do so only in the negative 
direction, i.e., from quadrant II to quadrant I, or from the quadrant IV 
to quadrant III. Hence if there exist closed or spiral characteristics, they 
must circle the origin in the negative direction. 

(ii) A characteristic crossing the z-axis must do so at right angles. 

We shall briefly enumerate the possible cases; the reader will have no 
difficulty in verifying our statements. 


Case l. k® >q; A,, A, real and distinct. Four rectilinear character- 
istics, of slopes A, and A,. 

(a) q>0; 4A, >0. Nodal point; all nonrectilinear characteristics 
tangent at the origin to the rectilinear characteristic whose slope has 
the smaller absolute value. 

(1) k>0. Stable equilibrium; rectilinear characteristics in quad- 
rants II and IV. 

(2) k <0. Unstable equilibrium; rectilinear characteristics in 
quadrants I and III. 

(b) gq <0; AA, <0. Saddle point; one rectilinear characteristic in 
each quadrant. 


Case Il. k® =q>0; A,, A, real and equal. Degenerate nodal point. 
Two rectilinear characteristics in opposite quadrants. 

(a) k>0. Stable equilibrium. 

(b) k <0. Unstable equilibrium. 


Case Ill. k®? <q; A,, A, conjugate complex. 

(a) k=0. Vortex point. Ellipses whose axes are the coordinate 
axes, 

(6) k>0. Stable spiral point. 

(c) k <0. Unstable spiral point. 
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The case where A, and /, are complex 


d*x dx 
represents damped oscillatory motion. Ifk = 0, the motion is harmonic. 
Ifk > 0, there is damping, and the amplitude of the oscillations approaches 
0. Ifk <0, the damping is negative and the oscillations become infinite. 
As an example of a saddle point, consider the motion of a pendulum 
about its point of unstable equilibrium, as in Fig. 8. The motion will 
be given by an equation of the form 


6+2k6—gsind=0, g>0, k>O0 


where we are justified (as we shall see in Part C) in substituting 6 for sin 
6, as far as discussion of the singularity is concerned. Then 6 = 0, 
6 = Oisa saddle point. If in the position of Fig. 8 
we give the pendulum a push of precisely the right 
magnitude, it will approach equilibrium as t — oo. 
This corresponds to one of the trajectories that 
approaches the origin. It is clear that these 
trajectories will very seldom be realized in nature. 

Finally, it should be observed that any second- 
order equation 


dx ( =) 
may be reduced to the form 
Fig. 8 dx, dz j 


This is a special case of the general nonlinear system of Part C. If F is 
defined for all x, %, the characteristics of (5) will have the properties (i) 
and (ii) mentioned after equation (3). 


PART C. 
NONLINEAR SYSTEMS 


10. Introduction 


We consider now the system 


dx dy _ 
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with an isolated singularity at a point which we may take to be the origin. 
Throughout this part we consider only singularities of the following type. 


Definition 2. The singularity (0,0) of (1) is said to be simple if the 
following condition is met: There exist constants a, b, c, d, where 


as 


such that if e,(x, y) and €,(x, y) are defined by 
P(x, y) = ax + by + (x,y) 


(2) 
O(x, y) = cu + dy + €,(x, y) 
then 
tim “@Y 9 j=1,2 (3) 
20 Va? + y? 
y->0 


It will appear presently that a simple singularity is isolated. A 
sufficient condition that (0,0) be simple is that P and Q should have 
first partial derivatives continuous in some neighborhood of the origin, 
with their Jacobian not vanishing at the origin. Then 


_ OP(O, 0) oP(0, 0) 9Q(0, 0) _ 99(0, 0) 
ET a Sag ra P| Sa 
and (3) is Taylor’s theorem. 
Now, we know the exact nature of the singularity of 
d: 
a = ax + by, OY ap doady (5) 


at dt 


from our previous discussion. It seems reasonable that the singularity 
of (1) should have essentially the same nature as that of the system (5) 
which is derived from it by taking the linear terms of the Taylor expansion 
for P and Q. We shall show that generally this is in fact the case. We 
consider the vector fields defined by (1) and (5), adopting as standard 
notation: \ 

Vix, y) = (P@, y), Q(, y)) 

V(x, y) = (ax + by, cu + dy) 

Lemma 2. A simple singularity (0, 0) of V is isolated. Asr = V x2 + y? 


approaches 0 in any manner we have: 


lim ==]; lim (arg V — arg V) = 0 7) 
ry [Pe 9 agen eee 


(6) 
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Proof. Since V is continuous and does not vanish on r = 1 
c = gb. |V7| = gb. [ax + by)? + (cx + dy)*}# > 0 
r=1 r=1 


Then for all r 
|V7| Ser (8) 
Since by (3) 
oe Ve? + 62 
fim a! = tim VT 9 (9) 
r—>0 r r>0 r 
we have 
7|_.. |v-P 
lim |= — | < lim = 0 10 
Pall 1Pi|S ate o 


From (10) it is clear that there is a neighborhood of the origin in which V 
cannot vanish, and the formulas (7) follow. We shall frequently use the 
lemma in this form: 

For « > 0 there is a circle r < ry about the origin in which 


<e, |largV—argV|<e (11) 


As in Part B, we shall generally perform an affine transformation upon 
x and y to reduce V to its simplest form. It is clear that this will not 
affect the validity of any of our work. 

For simple singularities we may state the following result regarding 
characteristics approaching a singularity: 


Theorem 2. Let C: (a(t), y(t)) represent a characteristic of V approach- 
ing the simple singularity (0,0). A necessary and sufficient condition that 
C enter (0, 0) with slope m is that 


_ ¥ Wt) 
lim —— =m 12 
ETE (12) 
where if m = +00 the limit must properly diverge. The only values m 
may have are the solutions of 
c+ dm _ 


a+bm oe 


where m = © is a Solution if and only if b = 0. 


Proof. The sufficiency of (12) we have already seen in § 2. Suppose 
then C enters (0, 0) with slope m 
Yt) 


ae x(t) = 
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Then we may write 
y(t) _ Q cu+dyte 
a(t) P ax+byt+e 


By (3) we have 
x y y 
. x(t) : rae : ones c+dm 
eG) ee ee rar (15) 
t+ addy pelea oi aero 


from (14). From the existence of the limit in (15) it follows that 


_ Y(t) y(t) : c+dm 
— = lim —, .c., t = 
in x(t) eee x'(t) aera a+bm 


This completes the proof. 

Equation (13) has at most two roots unless a = d, b = c = 0, when it 
is satisfied by all realm. Hence just as in the linear case there are at most 
two directions in which a characteristic 
can enter the origin, except for , 
the completely degenerate nodal point 
of §6. Theorem 2 of course holds V 
for linear systems as a special case. 5 +4 

Throughout the next three sections 
we shall adopt the following notation 
as standard. ¢(2, y) will denote the 0 x 
angle (with the proper sign) between 
the vector V(z, y) to a point A: (zx, y) 
and the radius vector AO; see Fig. 9. 

We make the definition unique by specifying 


—n<¢<t7 


Fig. 9 


(x, y) will denote the corresponding angle for V. Observe that p(k, ky) 
= y(z,y), k #0. For any e > 0 there exists a circle r< ry in which 
lé—- yl <e 

It will frequently be convenient to represent the characteristics of V in 
polar form 


r=r(t), 6 = A(t) 
where r(t) and 6(r) will then satisfy the equations 
do__|V|sing 
dt r 


(16) 
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11. Nodal Points 


We consider first the case where V has a nodal point, i.e., where the 
roots of 


are real, unequal, and of the same sign; say 4; <A, <0. Then by a 
suitable affine transformation we may take the vector fields in the canonical 
form: 

V= (Ar + (2, 9), Ay + €2(x, y)) 


4 = (A,2, Agy) 


(1) 


where 


From our conditions on 4, and A, it follows that everywhere |p| < 7/2 
(see Fig. 2 of this chapter), and since y is continuous and a function only 
of y/x, |y| has an upper bound y < 7/2. Also, in the open quadrahts 
I and Ill, 0 < p< wp < a/2 and in II and IV, —7/2 < —y< yp <0. 
See Fig. 10. , 


Fig. 10 


For any e > 0, there is a circle ': r<_r, in which V vanishes nowhere 
and |¢ — y| << «. Takee =4(407 — y). Thenin 


Isl<yte=s—e (2) 
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Let C: (r(t), 0(¢)), t > to, be any positive half-characteristic of V starting 
in r <r, expressed in polar form. Then 

dr dé _|V| sin @ 
—_—_—= = =_ V ) —_ = OO 
ai |[V| cos d 

Since by (2) cos ¢ > 0 in I’, r(t) is monotonically decreasing. Therefore 
C cannot get out of I’, and hence by Lemma | is defined for all ¢ > t. 
Since this is also true for any circle r << r, < r,, the equilibrium is certainly 
stable. Suppose C does not approach the origin. Then, since r(t) is 
monotonic and does not approach 0, it has a g.1.b. ry > 0. 


m<r<n t2h (4) 


(3) 


dt r 


But in ry <r <_1r,, |V| has a lower bound k > 0; therefore by (3) 


ee kos (2 
dt 7 ea 


r< —kcos (2 _ c)ce —t) +h, t> (5) 


Then for ¢ sufficiently large, r(t) < rp, contrary to our assumption. There- 
fore every half-characteristic starting in I‘ remains in I and approaches 
the origin. 

By Theorem 2 we see that a characteristic of V can enter the origin 
only along one of the co-ordinate axes. We shall prove first that every 
characteristic starting in I’ enters the origin; precisely: If a positive half- 
characteristic of V starting in \' does not enter the origin along the x-axis, 
it enters along the y-axis. 

Let 6 be any positive number. Let us surround the x- and y-axes 
with sectors of angle 6, and label the regions of the plane so defined as in — 
Fig. 11. 

Now it is easily verified that there is a p, > 0 which depends on 6 such 
that in all the B-regions, |y| > y,(6) > 0. Then for every 6 > 0, there 
exists a circle r< p(4) in which |¢ — y| < 4y,(6). Let us define 4,(6), 
B,(6), etc., as these parts of A,, By, etc., lying inr < p(6). Then in B,(6), 
|d| > 4y,(6) > 0; where in B,(6) and B,(6), ¢ > 0, and in B,(6) and B,(6), 
¢@ <0. Therefore on the rectilinear boundaries of A,(6) and A,(0), V 
points into a B-region; and on the rectilinear boundaries of A,(6) and 
A,(6), V points out of a B-region. Since no half-characteristic starting in 
r< p(6) can get out of it, we can make the following statements: 

(a) No characteristic starting in A,(d) or A,(6) can get out of it. 

(b) No characteristic starting in r < p(6), but not in A,(6), can get into 
A,(6); and likewise for A,(6). 
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Let now C: (a(t), y(t)) be a positive half-characteristic starting in 
r<r,. Then we observe: a necessary and sufficient condition that C 
enter the origin along the x-axis on the right is that for every 6 > 0 there 
exists a number 7(6) such that for ¢ > #(6), C lies entirely in A,(6). A 
similar remark holds for the other three possible methods of entrance. 


Fig. 11 


Suppose now that C is a half-characteristic starting in r< 7, which 
does not enter the origin along the x-axis. Then by the remark above, 
for some value do, C lies in one of the regions B,(6,); say without loss of 
generality B,(d)). Now assume that for every 6 < 6, C eventually enters 
A,(6). Suppose this were not so for some value of 6, say 6,. Then for 
all values of t > 4, say, C remains in B,(d,). But in B,(d,), by (3) 


dé i 
do _ |v sin? _ 9 
dt r 


since by construction ¢ was positive in B,(6,). Therefore 6(t) is mono- 
tonic, and since 0(t) < 4(7 — 6,) 
WT 


lim 0(t) = 0) < (6) 


tl «0 2 


This implies that C enters the origin along the line of slope tan 6), whereas 
we know that C can enter the origin only along one of the co-ordinate 
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axes. Therefore C enters every A,(6); and since it must stay in every 
A,{6), it enters the origin along the y-axis from above. Similar remarks 
hold if C is ever in one of the other B-regions. 

We have thus proved the following statement: Every half-characteristic 
starting in ': r <r, enters the origin, and every half-characteristic that 
for any value of ¢ is in one of the regions B,(6) or B,(6)[B,(6) or B,(6)] 
enters the origin along the y-axis from above [from below]. 

We prove now (by an elementary topological argument) that at least 
one characteristic enters the origin along the z-axis on either side. 
Suppose, on the contrary, that there is no half-characteristic entering the 
origin along the x-axis from the right. Then consider the set R = A,(6,), 
any do, the origin not included. R is known to be connected. Let M 
be the set of all points (v, y) of R such that the characteristic passing 
through (x, y) enters the origin along the y-axis from above, and N the 
set of points of R such that that characteristic enters the origin along the 
y-axis from below. Then 


M+N=R; M:-N=0; M,N#0 (7) 


Suppose now (2p, Yo) lies in R. Let C: (a(t), y(t)) be the characteristic 
passing through (%, y)) for ¢ = 0; then for t = 4, say, the point (a(t), 
y(t,)) lies in the interior of some B,(6), for some 6. Then by the con- 
tinuity of characteristics in their starting point, there exists some neighbor- 
hood Ng of (xo, Yo) such that every characteristic starting in N, enters 
B,6) and hence enters the origin along the y-axis in the same direction as 
C. That is to say, M and N are open in R. But this, together with (7), 
implies that R is not connected, a contradiction. Hence our assertion. 

If now two characteristics C, and C, enter the origin along the x-axis 
on the right, clearly any characteristic starting between them within r < r, 
also enters along the x-axis. Also, if we denote now by &M the set of all 
points (x,y) within r<r, such that the characteristic through (2, y) 
enters the origin along the y-axis, by the preceding argument M is open in 
R; wherefore its complement in R is closed. That is to say, of all 
characteristics that enter the origin along the z-axis, there is a “‘last” one 
on either side, in the following sense: if we consider the points of the 
circumference of r < r,, those through which the characteristic enters the 
origin along the z-axis are distributed in two closed arcs, as shown in 
Fig. 12. 

Thus we are certainly justified in calling (0,0) a nodal point. If 
sufficient further restrictions are put upon P and Q, it is possible to prove 
that there exists precisely one characteristic entering the origin along the 
a-axis on either side. The characteristics will then present precisely the 
same configuration as for a linear system (Fig. 13). The “normal” 
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direction of approach (in our previous case, the y-axis) corresponds to 
the root 4 of the characteristic equation with smaller absolute value. 


Fig. 12 


Fig. 13 


In the general case, the pictures will be affinely distorted but the con- 
figurations will remain unchanged. In the course of our work we proved 
that if r = 7, is a sufficiently small circle, then every characteristic inter- 
secting it must cross from its outside to inside. In the general case, these 
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circles will become ellipses with their axes in certain directions. They 
are the curves without contact of which we shall speak in the next chapter. 

Finally, if A, > A, > 0, the singularity will be an unstable nodal point, 
and as t -» — oo we have the same picture as before. 


_ 12. Saddie Points 


Suppose now the linear vector field V has a saddle point at (0, 0); 
then the characteristic values will be real and of opposite sign, say 


A<O0< A, 
We may then assume the canonical form 
4 = (A,2, Ay) 
V = (Ax +°&, Ay + €) . (1) 


The only directions along which a characteristic can enter the origin are 
the co-ordinate axes. 

The angle y between V and the radius vector AO (Fig. 14) will behave 
as follows: 

1. In quadrants I and III, y > 0. 

_2. In quadrants II and IV, py < 0. 
3. On the z-axis, yp = 0. 
4, On the y-axis, y = 7. 


Fig. 14 


It is clearly possible to surround the y-axis by a sector S of angle y > 0, 
such that in S, |y| > /2 + 2e, ¢€>0. Then in some circle r<7,, V 
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does not vanish (except at the origin) and |6 — y| < «. Let S, be the 
part of Sinr<7,; then in S, (Fig. 15), || > 7/2 + «. 

We prove that the origin represents an unstable equilibrium. Let C: 
(r(t), O(t)), t > t, be any half-characteristic starting in S,, where r(t)) = ro, 


y 


~—y-” 


Fig. 15 


say. Then C can only get out of S, by going out of r=7,; for on the 
rectilinear boundaries of S,, V points into S,. But in S, we have 


© = —|P| cos = —|V|c0s (3 + «] (2) 


and in that part of S, for which ry << r< 7, | y| has a lower bound k > 0. 
Therefore as long as C stays in S,, we have 


r(t) > —k cos (z ee a —t) + (3) 


Fig. 16 
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and C must eventually go out of S,. Therefore the equilibrium cannot 
be stable. 

It can be shown that any characteristic not entering the origin cannot 
approach it, and that any characteristic not approaching the origin must 


go out of the circle r<r,. We may also show (as in § 11) that at least 
one characteristic must enter the origin along the x-axis on either side, 
and that all such characteristics form a continuum. Similarly, there will 
be at least two characteristics approaching the origin along the y-axis 
for t= —oo. The singularity is therefore a saddle point, and we have 
a picture of the form shown in Fig. 16. Under further conditions on 
P(x, y) and Q(x, y) it can be shown that precisely two characteristics 
enter for t = — oo, with that of the linear case illustrated in Fig. 17. 


13. Spiral Points 


Suppose finally the linear system V has a spiral point; i.e., the char- 
acteristic values A,, A, are conjugate complex: say A = « + iB, «, B #0: 
For definiteness take « <0. We may then take the vector fields in the 
canonical form 

V = (ax — Py + &, Px + ay + €) 


V = (ax — By, Bx + ay) 


The characteristics of V are a family of logarithmic spirals approaching 
the origin as t-> 00. It is known that every radius vector will cut a 
logarithmic spiral in the same angle y, and since all the spirals are similar 
y will be a constant, |p| < 7/2. 


(1) 
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Take « = $447 — ||). Then for some circle r < ry 
. 7 
le-v<s lvl—e<l|¢l<lylt+e=5-« 


Let C: (r(t), 9(t)), t > t, be any half-characteristic starting in r< ro. 
Then 


dr do |V|sing 
— = —|V| cos 0, — = ——— > 0 2 
dt IV in dt r a (2) 
Therefore r(t) is monotonically decreasing, and C stays in r < ro, where- 
fore the equilibrium is certainly stable. If C does not approach the 
origin, we prove a contradiction as in the corresponding proof of § 11. 
Therefore every characteristic starting in r< ry approaches the origin. 
No characteristic can enter the origin, since (13) of § 10 has no real roots 
in our case. Since 6(t) is monotonic, by (2) it follows that lim 6(t) = 00, 
t+o 
i.e., that the characteristics circle the origin an infinite number of times. 
Therefore the origin is a stable spiral point. If we write the system 


without making an affine transformation 


Tm ar + by + & Ye cot dy te (3), 
the direction of rotation of the spirals will be the same as the sign of c. 
If «=}(a+d)>0, the same analysis will hold except that the 
equilibrium is unstable. 

Observe that here again the circles r=r,< rg are curves without 
contact; for the general system (3) they become a certain family of 
ellipses. 


14. Indeterminate Cases 


The cases where the linear system V has a degenerate nodal point are 
of minor importance, and we omit them. However, if V has a vortex 
point, the behavior of the nonlinear system is rather interesting. We may 
reduce to canonical form 


da dy 
a —By + a(x, ¥); a Bt + eal) (1) 
In polar form 

dr_1( dx 4) set 

parla tal Oe 

d6 1/ ay dx et — GY o 
Eg —— €. 

a aleG— Gg) ete 
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Therefore for r small, dr/dt is very near 0 and d6/dt is very near B. The 
nature of the singularity is indeterminate since we do not know the sign 
of dr/dt as in the previous sections. There are a number of possible 
cases: 


Case |. (0,0) may be a vortex point. This is true when (1) is linear. 
Consider also the equation 


d’y (4) = 
ie + at +y= (3) 
We may write (3) in the form 
dx dy 
—=—_ os — 2 — = 4 
dt Ms ge ® 


This is of type (1). Suppose in r < rg that d0/dt > 4. Consider the 
characteristic C of (4) passing through the point (2, 0), 2, > 0, for t = 0. 
Then for x, sufficiently small, C will remain in r < rg for |t| << 2 by 
virtue of (2) with «, = —2* and «, = 0. Therefore C will intersect the 
negative x-axis at some point (z,,0). But since (4) is unchanged if we 
replace t by —t and y by —y, it follows that the negative characteristic 
starting from (%,, 0) also passes through (1, 0), ie., that C is closed. 
Since this is true for all characteristics sufficiently close to (0,0), the 
singularity is a vortex point. 


Case Il. Consider now the equation 


a"y (2) 2 
i Aa ee 
1.e., the system 
dx dy 
Here 
dr__— xt" 
dor 


Therefore except on the y-axis, dr/dt < 0, and it is easy to see that no 
characteristic can be closed. By a familiar argument, it follows that every 
characteristic must approach the origin; and since d6/dt is positive and 
bounded away from 0 in some circle about (0, 0), the origin must be a 
stable spiral point. 
In the same way, the origin is an unstable spiral point of 
ay _ (2)’ - 
a Nay ee 
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Case I/II. Consider finally the system 


. =-—y+ ar sin= = P(x, y) 

- (6) 
ak Se me 

A = 2 + yr?sin™ = Q(x, y) 


This has no singularity but (0, 0). It is readily verified that P and Q have 
continuous first derivatives everywhere, including the origin. In polar 
form (6) becomes 


I (7) 


The circles r = 1/n (n = 1, 2,- : -) are closed characteristics. Further- 
more 


dr 
dr 1 1 


Fig. 18 
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Therefore no characteristic but the circles r = 1/n can be closed; for 
any characteristic must remain in one of the regions r > 1, 1/2m<r< 
1/Q2m — 1), 1/Q2m + 1)<r< 1/2m(m=1, 2,- ++); and in each of 
these regions r is strictly monotonic. Furthermore every nonclosed 
characteristic must either approach one of the circles as t-» +00 or 
become unbounded; for in every closed bounded region R of the plane 
not containing any point for which r = 1/n, |dr/dt| is bounded away from 
zero, and a characteristic starting in R must go out of R. Finally, non- 
closed characteristics must approach the circles spirally, since d0/dt = 1. 
The circles r = 1/n are stable or unstable accordingly as n is even or odd. 
See Fig. 18. 

Such a singularity, with both closed and nonclosed characteristics 
arbitrarily close to it, is known asa center. By ingenuity, very complicated 
centers may be constructed. 


oe 


Solutions of 
an Autonomous System. 
in the Large 


1. Introduction 


In this chapter we consider the geometrical picture presented by the 
characteristics of 


dx 


as dy _ 
dt = P(x, y), dt aes Q(z, y) 


over the entire domain D in which P and Q are defined. We shall assume 
an elementary knowledge of the geometry of the plane, including the 
Jordan curve theorem. Our main result will be the classical theorem of 
Poincaré-Bendixson on the existence of closed characteristics (“limit 
cycles”). To avoid confusion, we shall use the term cyclic characteristic 
to describe a closed characteristic curve representing a periodic solution, 
and use closed to denote topological closure only. 
Consider, for example, the system 


ony + ofl — ot 99, 


dy ae ae 
7 =—r+y1—-2-—y) () 


dt 
102 
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Transforming to polar co-ordinates we get 


dr 4 d0 
qe) a (2) 


It is easily verified that the most general solution of (2) is 


1 


a eed 6 = —(t — ty) (3) 
V1 + ke ; 
with k and 4% arbitrary. Let us set 4) = 0 for convenience; then every 

solution of (1) may be written in the form 


cos t —sin t 


Vigke® 7 Vigke® 


If k = 0, (4) is the circle 2? +y?=1. If k = +c, (4) represents a 
spiral approaching the origin as t-+ —oo, and approaching the circle 
spirally from the inside ast» +00. If k = —c?, (4) represents a curve 
approaching oo as t-> + loge + 0, and approaching the circle from the 
outside as t-» +00. We have thus a picture of the form shown in Fig. 1. 


(4) 


Fig. 1. 


Therefore there exists a single cyclic characteristic, which all the non- 
cyclic characteristics approach spirally as t-»-+00. Such a cyclic 
characteristic is known as a limit cycle after Poincaré. By the methods 
of this chapter, the existence of limit cycles may be proved for a general 
class of equations of physical interest. 
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2. Geometrical Considerations 


Let P(x, y) and Q(x, y) have continuous first-order partial derivatives 
in a certain domain D, and let V(x, y) = (P, Q) be the vector field they 
define. 


Definition 1. A segment without contact (with respect to V) is a finite 
closed segment L of a straight line, such that 

(a) Every point of L is a regular point of V. 

(b) At no point of L does the vector V have the same direction as L. 


That is, a segment without contact passes through no singularity of V 
and is tangent to no characteristic of V. We proceed to develop some 
elementary properties of such segments. 

(i) Through any regular point A: (Xp, y¥)) of D there may be drawn a 
segment L without contact containing A in its interior, and L may have 
any direction except that of V(2p, Yo). 

(ii) Every characteristic intersecting a s.w.c. L actually crosses from 
one side of L to the other, and all characteristics crossing L do so in the 
same direction. 

The proofs of (i) and (ii) follow immediately from the continuity of V ‘ 
and the uniqueness of characteristics. 

(iii) If C: (x(t), y(t)), a< t <b, is a finite arc of a characteristic of V, 
then C cannot cross a given segment without contact Z more than a finite 
number of times. 


Proof of (iii). For let {t,} (2 = 1, 2,- + +) be an infinite sequence of 
different values of t, a<t, <b, representing points {A,} of the inter- 
section of L and C. Then {t,} has a subsequence converging to some 
number 4, @<ty<b; without loss of generality we may take this 
subsequence to be {f,} itself. Let Ay be the point of C given by t = 4; 
clearly A, ison Z. Then the sequence {A,} approaches Ay. If more than 
one point A, is the same as Ag, then C is a cyclic characteristic of period 
h>0. Hence if an infinite number of A, coincide with Ag, then b — a 
must exceed any multiple of h, which is impossible. Therefore we may 


take every A, different from Ay. Then {4,4,} is a sequence of secants 
whose limiting direction is the tangent to C at Ay. But this limiting 
direction is the direction of L; wherefore L is tangent to C at Ap, a con- 
tradiction. Hence there can be at most a finite number of intersections 
of C and L. 


(iv) Let A be an interior point of a segment L without contact. Then 
for every « > 0 there exists a circle I‘ with center A, such that every 
characteristic passing through a point of I for t= 0 intersects L for 
t = some number fg, where |f)| < . 
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Proof of: (iv). Let A be the origin of (x, y) co-ordinates and L be on the 
x-axis, The solutions (x(t, X, Yo), Y(t, Zs Yo)) Of 


dx dy 
dt ie P(a, y), dt = Ox, y) 
have continuous first derivatives with respect to t, 7%, Yo in some sphere 
1? + x2 + Yo? < p%. Moreover [dy(0, 0, 0)]/dt #0 by the definition of 
a segment without contact. Therefore by the implicit function theorem 
the equation y(t, 2%, ¥o) = 0 has a unique continuous solution t = ¢(%, Yo) 
for (X,Y) in some circle I’ about (0,0). Since 40,0) =0, for T 
sufficiently small we shall have |t(2, ¥o)| < ¢€ for (xq, Yo) in I’, and the 
proof is complete. 

Suppose now that C: (x(t), y(t)), t > to, is a positive half-characteristic 
of V, lying in a closed bounded region R© D. Then by Lemma 1, 
Chapter 4, C is defined for all ¢ > %. 

We shall say that a point (2%, Y) is a limit point of C if there exists a 
sequence {t,} —> oo such that the sequence of points {x(t,), y(t,)} converges 
to (Xp, Yo). C must have at least one limit point; for the sequence of 
points {x(t +n), y(t) +m} (n= 1, 2,-- -), being contained in the 
closed bounded region R, has a convergent subsequence. The set of all 
limit points of C is denoted by C’; clearly C’ < R. 


"3. Basic Lemmas 


We assume throughout this section and the next that C: (z(t), y(¢)), 
t >to, is a positive half-characteristic contained in a closed bounded 
region RC D. 

Lemma 1. The limit set C’ of C is closed and connected; and if d(M; N) 


is the distance between the point sets M and N, we have 


lim d{(x(9), y(t); C’] = 0 (1) 


Proof. Suppose {A,} is a sequence of points of C’ converging to the 
point A. Then there exists a sequence {t,}, where ft, >, and 


Alt), Y(t), And <~ 


Then the sequence {x(t,), y(t,)} converges to A, whence A is a point of 
C’; wherefore C’ is closed. 

Suppose C’ is not connected. Then we can represent it as a sum of two 
disjoint nonnull sets M and N, each closed in C’ 


C'’=M+N; M-N=0; M,N #409 (2) 
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Since C’ is closed M and WN are closed, and since M and N are disjoint, 
closed, and bounded, they have a nonzero distance 6. Now there exists 
a monotonic sequence {t,} —> oo such that 


withnodd, = d[{z(t,), y(t,)}; M] < : 
(3) 


withneven, d[{x(t,), y(t,)}; NI< ‘ 


Therefore by the continuity of x(t) and y(t) there exists a sequence {t,’} 
such that 
bon—1 < t, < lon aes l, 2, Tee al {x(t,’), Y(tn’)} 3 M) 
= df(t), yta)}3 NI 


hence 


altatt,), Wiad}: (M+ NSS (a 


The sequence of points {z(t,’), y(t,')} must have a subsequence con- 
verging to a limit point (Z, 9) of C’; but by (4) 


dG, p); M+ N)>5 (5) 


a contradiction. Therefore C’ must be connected. 
Finally, if (1) does not hold, there must exist a number 6 > 0 and a 
sequence {t,} — oo such that 


d{{x(t,), ¥t)}s C'S 


Then as from (4) above, a contradiction is easily found; thus all parts of 
the lemma have been proved. 


Lemma 2. If C’ contains a regular point A: (%, ¥), the characteristic T 
through A lies entirely in C’. 


Proof. Let (x(t), g(¢)) represent I, where 


(2(0), 7) = 9) (6) 
Let {t,} —> co be such a sequence that the sequence of points 
{x(t,), Y(tn)} > & 9) (7) 


Define new representations of C by 


(2a(t), Yat) = (HE + ty), Yt + F,)) . (8) 
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Then from.(6) and (7) we have 
(x,,(0), y,(0)) — (€(0), 9(0)) (9) 


Now let (Z(t), J(t)) be any point of I’. It follows from (9) by the 
continuity of solutions in their initial values that 


(2(to + tn)» Ylto + tn) = nto)» Yn(to)) > (Eo), G(t0)) (10) 


Since fy + t, —> ©, (X(ty), J(to)) is in C’ and the lemma follows. 

Therefore C’ consists of a set of singularities and a set of complete 
characteristics. Observe that as yet we have not used the topology of the 
plane in any way; our results in fact hold for a vector field in n-dimensional 
space. The following lemma, however, depending as it does on Jordan’s 
curve theorem, is peculiar to the plane. 


Lemma 3. Suppose C’ contains a regular point A. Then if L is a 
segment without contact through A, there exists a monotonic sequence 
{t,} > 00 such that the points of intersection of C and L are precisely 
{A,} = {x(t,), y(t,)}. If Ay = Ag, then A = A, (n = 1, 2,: + -) and C is 
cyclic. If Ay # Ag, all the A,’s are different and on L, and A,,,, is between 
A,, and An, (n = 1, 2,° + °). 


Proof. By §2(i) there exists a segment Z without contact passing 
through A. Now every circle about A contains points of C corresponding 
to arbitrarily large values of t. Therefore by § 2(iv) L is cut by C in 
points corresponding to arbitrarily large values of f; i.e., there are an 
infinite number of values of t corresponding to points of intersection of 
Cand L. But by § 2(iii) any finite arc of C can cut L in only a finite 
number of points. Therefore the points of intersection may be represented 
by an infinite monotonic sequence {t,}—» 0o as required. Let {A,} be 
defined as in the statement of the theorem. If A, = A,, C is cyclic, 
A, = A,, and since A is an accumulation point of the A,’s, A = A; = A,,. 
It remains, therefore, to prove the last statement; and only in this part of 
the lemma is the topology of the plane essential to the proof. 

Suppose then A, # Ay. C does not intersect L for t; < t < t,; there- 
fore that part of C given by t, << fy, plus the segment A,A,, forms a 
simple closed Jordan curve [. There are two cases: 


Case |. For t >t, C is inside [’ as shown in Fig. 2. Then to get 
outside [, C must cross I, by Jordan’s theorem; but C cannot cross 
itself, and it cannot cross A,A, in the wrong direction [cf. § 2(1i)]. There- 
fore C remains inside [' for allt > t,. Hence it is clear that A, is different 
from A, and Ag, and that A, is between A, and A3. The statement of the 
lemma follows by induction. 
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Case Il. For t > t,, C is outside T as shown in Fig. 3. Then C cannot 
again get inside I’, and the argument proceeds as before with obvious 
modifications. 


Ai] Ag 


Fig. 2 


Fig. 3 


It follows that the sequence of points {A,,} has only the one accumula- 
tion point A. From this remark we have immediately: 


Lemma 4. No segment L without contact can cut C’ in two distinct 
points. 


Lemma 5. If C’ contains a cyclic characteristic F, it contains no other 
point. 


Proof. Suppose on the contrary that for some C, C’ contains a cyclic 
characteristic F and at least one other point. C’ — Fis not null, and 
hence cannot be closed; for otherwise C’ would be the sum of two 
closed disjoint sets (C’ — F and F) and hence not connected, contrary to 
Lemma 1. Since C’ is closed, there exists a point A on F which is an 
accumulation point of points of C’ — F. Let L be a segment without 
contact through A. Since every circle about A contains a point of 
C’ — F, by § 2(iv) it is clear that L must be cut by a characteristic 
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belonging to C’ — F. Therefore L cuts C’ in two distinct points, contrary 
to Lemma 4. This completes the proof. 


4. Theorem of Poincaré-Bendixson 


We now have all the materials to prove our main results. We assume 
as before that C: (a(t), y(t)), t >t, is a positive half-characteristic 
contained in a closed bounded region R © D. 


Theorem 1. (Poincaré-Bendixson). If C’ contains no singularity of V, 
then either: 


(1) C (= C’) is a cyclic characteristic, or 
(2) C’ consists of a cyclic characteristic F (a “‘limit cycle”) that C 
approaches spirally from either the inside or outside. 


Proof. Let A be a point of C’, and F the characteristic through A. 
Let F’ be the limit set of F. Since C’ is closed, F’ < C’. F’ contains at 
least one point B which (being in C’) must be regular. Let L be a segment 
without contact through B. By Lemma 4, L cuts F in precisely one 
point (namely B). Therefore by Lemma 3, F is cyclic. By Lemma 5, 
C’ = F, wherefore in any case C’ consists of the single cyclic characteristic 
EF 
‘If C is cyclic, C = F and we have Case I of the theorem. If C is not 
cyclic, C is either inside or outside F. Let A be a point of F, L a segment 
without contact through A, and {A,} (n = 1, 2,- - -) the sequence of 
points of intersection of L and C, corresponding to the monotonic 
sequence {t,} — oo (see Lemma 3). Then by simple arguments based on 
Jordan’s theorem, we see that according as C is inside or outside F we 
have the configurations shown in Fig. 4. Therefore the approach is 
certainly spiral. 

We remark further that by considerations of continuity it is not too 
hard to show that 


lim (ta — ty) = A () 


where h is the period of F. Thus the motion on C as too differs 
arbitrarily little from the periodic motion represented by F. Finally, if 
we replace t by —/, we may obtain analogous results as to the behavior 
of characteristics when t > — oo. 

In the course of our work we showed that if C’ contains a point of C, 
C is necessarily cyclic. This is not true for higher dimensions and the 
analogue of the Poincaré-Bendixson theorem does not hold. 
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Fig. 4 


Theorem 2. Suppose the singularities of V are isolated and C’ contains a 
singularity A. Then 

(1) If C’ contains no regular point, C’ consists solely of the point A, 
which C approaches as t > 00. 

(2) If C’ contains a regular point, C’ consists of a finite set of singularities 
{A,} and a set of characteristics {C,}, where as t > + © each C, approaches 
a singularity. 

Proof. If C’ has no regular point, being connected it can contain only 
A, and by Lemma 1, C approaches A as too. If C’ contains a 
regular point, C’ consists of a set {A,} of singularities and a set {C,} of 
characteristics, where {A,} must be finite since the region R in which C’ is 
contained is bounded and closed. If any C,’ contains a regular point, 
C, must be cyclic as in the proof of Theorem 1; which by Lemma 5 
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contradicts the assumption that C’ contains a singularity. Therefore no 
C,' contains a regular point; hence every C, approaches a singularity as 
t — +00, and likewise as t -~ —oo. 

Suppose now that there exist two concentric closed curves C, and Cy, 
bounding an annulus A free from singularities, and that on C, and C, 
V everywhere points into A. Then a characteristic starting in A must 
remain in A, wherefore there exists at least one limit cycle Fin A. By 
§ 5 it will follow that F must loop 4, i.e., that its interior cannot lie wholly 
in A. 

By constructing such curves Liénard and Levinson have shown the 
existence of limit cycles for quite general equations. Details of their 
methods and further references will be found in the Brown University 
notes on Non-Linear Mechanics.* 


5. Poincaré’s Index 


Consider a simple closed Jordan curve C in the (x, y) plane passing 
through no singularity of V = (P, Q). C may be represented in the form 


z=2(s), y=ys); (<s<s,) (1) 
2(Sy) = x(5,), y(So) = (sy) (2) 


where x(s) and y(s) are continuous (sy << s < s,), and (2) does not hold for 
any other pair of values of s. We can stipulate further that as s increases C 
be described in the positive direction. 

It can be shown without difficulty that there exists a unique function 
@(s), continuous for 55 <5 < s,, and satisfying 


—7 < WS) <7 
O(x(s), y(s)) 
VP? + OF (3) 


P(x(s), y(s)) 
VP +o 


We then define the index of the curve C (with respect to V) by 


6(s) — (Sy) 
2a 


sin 0(s) = 


cos 0(s) = 


(C) = (4) 
Clearly I(C) is an integer. 


* K. O. Friedricks, P. LeCorbeiller, N. Levinson, and J. J. Stoker, Non-Linear 
Mechanics (mimeographed notes), Brown University, Providence, 1943. 
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It can further be shown that J(C) is independent of the parametric 
representation (1) of C. It has the following topological meaning. Let 
us regard 

P=P(x,y), Q= O(x,y) 


as a many-one mapping of the z-y plane onto the P-Q plane. Under this 
mapping C goes into a certain closed curve C’ not passing through the 
origin of the P-Q plane. Then J(C) is the number of times C’ circles the 
origin in the positive direction. 
The following properties of the index, though by no means trivial to 
prove, are fairly obvious intuitively: 
(a) A simple closed curve C containing in its interior no singularity of 
V has index 0. 
(b) A cyclic characteristic F of V has index 1. Hence it follows that a 
cyclic characteristic must contain at least one singularity. 
Suppose now C, and Cy, are 
Cy two concentric curves containing 
no singularity of V between them. 
Then they have the same index. 
For by introducing a suitable cut L- 
=e joining C, and C, (Fig. 5), we can 
define a closed curve I’ containing 
no singularity whose index is there- 
fore 0. But since the changes in 0 
along Z and back cancel, we have 


Fig. 5 UP) = (Cy) — (C) 


i.e., C, and C, have the same index. 
In particular, if A is an isolated singularity of V, every simple closed 
curve about A containing no singularity but A will have the same index. 
This common index we call the index of A. It can be shown without 
difficulty that if all the singularities of V are isolated, the index of any 
curve C is equal to the sum of the indices of the singularities it contains. 
The formula of function theory giving the number N of zeros of an 
analytic function f(z) inside a curve C as a certain contour integral 


i ff@O@, 1 
= [, 7@ dz = oe Ag arg (f(2)) 


is a special case of this result. 
If F is a cyclic characteristic of V, and all the singularities of V are 
isolated, then F contains a finite number of singularities the sum of whose 


indices is 1. 
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6. Orbital Stability of Limit Cycles 


By Theorems 1 and 2, if C is a positive half-characteristic remaining in 
a closed bounded region R < D and C’ contains one regular point, then 
C approaches C’ spirally from either the inside or outside. We shall now 
prove that every characteristic starting sufficiently near C’ on the same 
side as C will also approach C’ spirally. 


Theorem 3. Let C approach its limit set C’ spirally from the inside 
[outside], and let C’ contain a regular point A. Then for «> 0 there 
exists 6 > 0 such that every positive half-characteristic starting inside 
[outside] C', and distant < 6 from C’, remains distant <e from C’ and 
approaches C' spirally. 


Proof. Since the singularities of V are isolated, there exists an 7 > 0 
such that there is no singularity not in C’ but distant < y from it. Take 
«<7. Let L be a segment without contact through A. Then there exist 
two points of intersection A, and A, of C and L, such that the part of C 
between A, and A, is everywhere distant < « from C’. Take 6 as the 
minimum distance of the arc AA» from C’. Then any characteristic I 
starting at a point distant < 6 from C’ and on the same side as C is in 
the region R bounded by C’, A,Ap, and A, A>. But I cannot get out of 
‘R, wherefore I’ < R. Suppose I‘ does not have a limit point on C’. 
Then I” is a limit cycle. If the interior of I” lies in R, there must be a 
singularity in R, a contradiction. Otherwise I’ must loop the doubly 
connected region R; which again is impossible, since C would have to 
cross I” to reach C’. Therefore I‘ must have at least one limit point 
on C’. If I approached a singularity of C’, C could not approach 
arbitrarily close to C’. Therefore I’ must contain a regular point B of C’. 

Let M be a segment without contact through B; then the intersections 
of C and [ on M alternate. Hence by an obvious argument C and [ 
have the same limit set, whence the theorem. 

Suppose in particular that C’ is a cyclic characteristic F. Then F is 
seen to be stable in a sense analogous to that of Liapunov for singularities. 
We are thus led to the following definition: 


Definition 2. A cyclic characteristic F is said to be orbitally stable if for 
every € > 0 there exists a 6 > 0 such that every positive half-characteristic 
starting at a point distant < 6 from F is defined for all increasing t and 
remains distant < « from F. 


(There are various other definitions of stability for characteristics, so we 
distinguish ours as orbital stability.) 
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Theorem 4. A necessary and sufficient condition that the cyclic character- 
istic F be orbitally stable is that for both the interior and exterior of F either: 
(1) A characteristic approaches F as limit cycle t + + ©, or 


(2) There exist cyclic characteristics arbitrarily close to F. 


Proof. From the preceding theorem, sufficiency immediately follows. 
Suppose F is orbitally stable and on the inside (say) there are no cyclic 
characteristics or singularities distant < e from F, for some «. Then a 
positive half-characteristic starting at a point distant < d(e) from F must 
have F as limit cycle. Hence the theorem. 


7. Index of Simple Singularities 
We shall finish this chapter by determining the index of any simple 
singularity and deriving certain applications. We observe first 


Lemma 1. If the simple closed curve C passes through no singularity 
of either of the vector fields V, and V,, and for no point of C 


larg V, — arg V,| =z, nan integer > 0 (1) 
then C has the same index for V, and Vy. 


For let 6,(P), 0.(P) (P on C) be the function of §5 for V, and V, 
respectively. Then setting 


$(P) = 9,(P) — 9,(P) 


since |A(P)| can never take on the value m7 (n = 1, 2,- - -), it follows that 
the change of ¢ around C which is a multiple of 27 must be 0 


Acd = Ac, — AcA, = 0 
Hence the lemma. 
In particular, let V(x, y) have a simple singularity at (0, 0), and V(x, y) 
be the corresponding linear field. Then for a sufficiently small circle 
about (0, 0) 


larg V — arg V| <5 
wherefore (0, 0) has the same index for both Vand V. We have therefore 
only to discuss the index of the singularity at (0, 0) of 
P(x, y) = ax + by a b 
Ole, y) = cx + dy e 
This will be the index of the circle C: r = 1. 


where 


| #0 (2) 
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Now PV cannot have the same direction at two different points (2,, y,); 
(a, Y2) on C; for otherwise we would have 


ax, + by, = k(ax, + by) 
ca, + dy, = k(cx, + dy), k>0 (3) 
ay? + yy? = 2,7 + ys” 


and the equations (3) have only the solution k = 1, x7 = 2%, y, = Yo. 
Parametrize C by 
x% = COS a, y = sin a, 0O<a<2n 


Then the angle 6 of V is a monotonic function 6(«). Hence C must have 
index either +1 or —1. 
But 6 decreases or increases with tan 6(«), and 


dy Q ccosa+dsina 
POSTING) 5 Bs cone Bane 
(4) 
ad — bc 
(a cos « + bsin a)? 


f'(~ = 


a 6 is positive. or 
cd P : 


Therefore f(«) and O(a) increase or decrease as 
negative. Hence the theorem: 


Theorem 5. A simple singularity has index —1 if it is a saddle point, 
+1 otherwise. 


For the condition that (0, 0) be a saddle point is precisely that 


a b 
eer ta 0 
Suppose now that the system : 
dx dy 


has only simple singularities. If these are all saddle points, there can be 
no cyclic characteristic. A cyclic characteristic must enclose an odd 
number of singularities, say 27 + 1, of which m are saddle points and 
n+ 1 are centers, nodes, or spiral points. Finally we are enabled to say 
that in certain regions there can be no cyclic characteristic. 
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